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proteinaVgonudeotides and oHgonuc.eo.ide ana.ogs denved from 
polynucleotides and ribozyn.es including same. mli _ tion sha „ 

Citation or identification of any reference m thts app , canon shah 
„o, be construed as an admission that such reference ,s ava.lable as pnor 
art to the present invention. 

SKTSTSC- — units consisting of 
uronic acid and a hexosamine. Biosynthesis of OAGs except hyaluron, 
ZZ s nfia^ from a core protein. Proteoglycans may conta.n several 
C3AG side chains from similar or different families. OAGs » 
. !s homopolymers which may subsequently be modified by N-deacetylauon 
° ndZ.fa.ion, followed by C5-epirnerizafion of* ucurontc „ .o 
iduronic acid and O-sulfation. The chemical compost.ton of GAGs from 
various tissues varies highly. ■ 

The natural metabolism of GAGs tn ammals ,s earned out by 

after fusion with lysosome, where desulfatton and exolyttc 
characterized so far are ,hc hyaluronidases. The hyaluronidases are a 

35 rri^^lb:— £ , P andHya, 2 were cloned 
1" put.ished (27). These enzymes share an overall homology *40* 
and have different tissue specificities, cellular locahzattons and PH 



optima. 
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Exolytic hydrolases are better characterized, among wh.ch are f> 
glucoronidase, a-L-iduronidase, and p-N-acetylglucosaminidase. In 
addition to hydrolysis of the glycosidic bond of the polysaccharide cham 
GAG degradation involves desulfation, which is catalyzed by 
lysosomal sulfatases such as N-ace Vl galac.osamine-4-sulfa.as=, iduronate- 
2-snlfatase and heparin sulfamidase. Deficiency in any of lysosomal GAG 
degrading enzymes results in a lysosomal storage disease, 
mucopolysaccharidosis. 

Glycosyl hydrolases: 

Glycosyl hydrolases arc a widespread group of enzymes that 
hydrolyze the o-glycosidic bond between two or more carbohydrates or 
bleen a carbohydrate and a noncarbohydratc moiety. The enzymatic 
hydrolysis of glycosidic bond occurs by using major one or two 
mechanisms leading to overall retention or inversion of the anomenc 
configuration. In both mechanisms catalysis involves two »■*■«; » 
proton donor and a nucleophile. Glycosyl hydropses have been ^sstfied 
into 58 families based on amino acid similarities. The glycosyl hydropses 
from families 1, 2, 5, 10, 17, 30, 35, 39 and 42 ac, on a large variety o 
substrates, however, they all hydrolyze the glycoside bond m a general 
. acid catalysis mechanism, with retention of the anomenc configurafiom 
The mechanism involves two glutamic acid residues, which are the pro on 
donors and the nucleophile, with an aspargine always preced.ng the proton 
donor. Analyses of a set of known 3D structures from this group revealed 
that their catalytic domains, despite the low level of sequence identity, 
, 5 adopt a similar (o/p) 8 fold with the proton donor and the nucleophile 
located at the C-.erminal ends of strands P 4 and |)7, respective^ 
Mutations in the functional conserved amino acids of lysosomal glycosyl 
hydrolases were identified in lysosomal storage diseases. 

Lysosomal glycosyl hydrolases including ^-glucuronidase p- 
30 manosidase, p-glucocenebrosidase, p-galac.osidase and a-L-iduron.dase, 
arc all exo-glycosy. hydrolases, belong to the OH-A elan and share a 
similar catalytic site. However, many endo-glucanascs from variou 
organisms, such as bacterial and fungal xylenases and cellulases share this 

catalytic domain (1). 
35 Heparan sulfate proteoglycans (HSPGs) 

HSPGs are ubiquitous macromolecules associated w.th the cell 
surface and extracellular matrix (ECM) of a wide range of cells of 
vertebrate and invertebrate tissues (3-7). The basic HSPG structure 
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consists of a protein core to which several linear heparan sulfate chatns are 

r^ently Iched. The polysaccharide chains are ^a, 

repeating hexuronic and D-glncosamine dtsacchar.de untu that are 

— - TtT£?o n r — » .°t E cm 

neurite outgrowth and tissue repair \j r 

Cha ns which are uniqne in their ability to bind a multttude of protems, 

8) HSPGS are also prominent components of blood vessels 5). In large 
vessels they are concentrated mostly in the intima and * 
wh ^as t capillaries thcy are fonnd mainly in the subendotheha 
« , membrane where they support proliferating and mtgrahng 
■ e^doThelial cells and stabilize the structure of the captllaty wall. The 
IhC o HSPOS to interact with ECM macromolecules such as collagen 
and fibronectin, and with different attachment sues on ptan 
membranes suggests a key role for Otis proteoglycan ,n , e self, emb 

ItS: r S« and cancer metastasis 
Resting that enzymes which degtade HS play tmportant roles ,„ 
25 pathologic processes. 

* a glycosylated enzyme that is involved in the 
ca,aho.ism P of certain glycosaminoglycans. U is an — 
that cleaves heparan sulfate a, specif.e intracham sues (12-15) nteractton 
,„ of T and B lymphocytes, platelets, granulocytes, macrophages and m^t 
el, with Jsubendothelia, extracellular matrix (ECM, - £ 
degradation of heparan sulfate by heparanase acttvuy (16) Connecuve 
tissue activating peptide .1. (CTAP), a c-chemoktne, was 
heparanase-like activity. Placenta heparanase acts - an adh s on 
35 molecule or as a degrada.ive enzyme dependtng on the pH of 
microenvironvent (17). 

Heparanase is released from intracellular compartments (e.g., 
lysosomes, specific granules) in response to various activation stgnals 
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(eg thrombin, calcium ionophorcs, immune complexes, antigens and 
mitogens), suggesting its regulated involvement in mflammauon and 
cellular immunity responses (16). released 
It waS also demonstrated that heparanase ean be reachly released 
from human neutrophils by 60 minutes incubation at 4 C m the absence of 

added stimuli (18). fx„ n H in 

Gelatinase, another ECM degrading enzyme whtch ts found tn 
tertiary granules of human neutrophils with heparanase, ,s secreted 
riuophils in response to phorbo. ,2-myris.ate ,3-acetate (PMA) 

"contrl various rumor cdis appear ,o express and secrete 
heparanase 7n a constitutive manner in corre.ation with their metastat.c 

^"'Zltion of heparan sulfate hy heparanase results in *e release 
, of heparin-binding growth factors, enzymes and P'«'^ ^ 
sequestered by heparan sulfate in basement membranes, extracellular 
matrices and cell surfaces (22-23). 

Heparanase activity has been described in a number of cell types 
including cultured skin fibroblasts, human neutrophils, activated rat T- 

human placenta and human platelets. ■ 

A procedure for purification of natural ^-"^ £ 
SK hepatoma cells and human placenta (U.S. Pat. No. 5,362,641) and for 
25 human platelets derived enzymes (62). 

Cloning and expression oj the heparanase gene 
A pmified fraefion of hepamnase isolated from human epatom 
cells was subjected to tryptic digestion. Peptides were separat by htgh 
pressure liquid chromatography (HPLC) and mtcro sequenc ^ The 
,„ sequence of one of the peptides was used to screen data bases for 
Zofogy to the corresponding bade translated DNA sequence Tta 
Ledum led to the identification of a clone containing an tnsert of !020 
*Z X bp) which included an open reading frame of 963 bp followed 
b ; 7 bp of ' untranslated region and a poly A mil. The new gene , w- 
35 designated npa. Cloning of the missing 5' end of Hpa was performed by 
Marathon RACE from placenta cDNA composite. Ihe jomed kpa cDNA 
( a,so referred to as « fragment contained , ^ ^ 
which encodes a polypeptide of 543 amino ac.ds wt.h a calculated 
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■ v.* «f *i 1Q2 daltons (2). The cloning procedures are 
08/922,170,09/109,386, 

described m length in U.S. Pat. appncai Tseg/17954. 
and 09/258 892, the letter is a continuat.on-.n-part of PCT/US98/17934, 
Wed August 31 1998. all of which are incorporated here,„ by reference. 

Z L mic locus which encodes heparanase spans about 0 Kb. » 
is composed of 12 exons separated by U introns and ,s locahzed on 

h "^~;of the npa gene product to catalyze degradation of 

^ ; i X. For this purpose, cel. lysates were incubated w,th su fate 
ECM-derived HSPQ Cpeax ,), followed by gel ftltrabon analys, 
, Z h-e 6B, of the reaction mixture. While the subsbate atone 
■ * a „f h\oh molecular weight material, incubation of the HM o 
consisted of high J wim containing 

iSin Tl1r^— Twas demonstrated that the heparanase 
enzyme expressed* cells infected with a pFApa ™ ^ ' 
delding HS complexed to other macromolecular constituents (e*. 
25 fib™ el, laminin, collagen) present in a naturally produced .n,a« ECM 
Tee U S Pat. application No. 09/109,386, which is mcorporated herem by 
in a manner similar to that reported for highly metastanc tumor 
cells or activated cells of the immune system (7, 8). 

Zferenta, expression of ,„e npa g ene in numan breas, ana 

M ^tZZZ^^ was applted to evaluate the expression 
of the LTgene by human breast carcinoma cel. lines exhtbttmg Afferent 
de ^of metastasis. A marked increase in npa gene _ express,™ - 
observed whieh correia.es to metastatic capacity of non-metas ta„c MCF-7 
35 "arcinoma, moderately metastatic MDA 23! and h^r. y m— 
S 435 breast carcinoma cell lines. Significantly, the d.fferent.al 
ZL of the npa gene expression correlated with the pattern of 
heparanase activity. 
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Expression of the hpa gene in human breast carcinoma was 
demonstrated by in situ hybridization to archival paraffin embedded 
human breast tissue. Hybridization of the hcparanase antisense nboprobe 
to invasive duct carcinoma tissue sections resulted in a massive positive 
5 staining localized specifically to the carcinoma cells. The hpa gene was 
also expressed in areas adjacent to the carcinoma showing fibrocystic 
changes. Normal breast tissue derived from reduction mammoplasty failed 
to express the hpa transcript. High expression of the hpa gene was also 
observed in tissue sections derived from human hepatocellular carcinoma 
,o specimens but not in normal adult liver tissue. Furthermore, tissue 
specimens derived from adenocarcinoma of the ovary, squamous cell 
carcinoma of the cervix and colon adenocarcinoma exhibited strong 
staining with the hpa RNA probe, as compared to a very low staining of 
the hpa mRNA in the respective non-malignant control tissues (2). 
15 A preferential expression of heparanase in human tumors versus the 

corresponding normal tissues was also noted by immunohistochemical 
staining of paraffin embedded sections with monoclonal anti-heparanase 
antibodies. Positive cytoplasmic staining was found in neoplastic cells of 
the colon carcinoma and in dysplastic epithelial cells of a tubulovillous 
20 adenoma found in the same specimen while there was little or no staining 
of the normal looking colon epithelium located away from the carcinoma. 
Of particular significance was an intense immunostaining of colon 
adenocarcinoma cells that had metastasized into the liver, as compared to 
the surrounding normal liver tissue. 
25 Latent and active forms of the heparanase protein 

The apparent molecular size of the recombinant enzyme produced 
in the baculovirus expression system was about 65 kDa. This heparanase 
polypeptide contains 6 potential N-glycosylation sites. Following 
deglycosylation by treatment with peptide N-glycosidase, the protein 
30 appeared as a 57 kDa band. This molecular weight corresponds to the 
deduced molecular mass (61,192 daltons) of the 543 amino acid 
polypeptide encoded by the full length hpa cDNA after cleavage of the 
predicted 3 kDa signal peptide. No further reduction in the apparent size 
of the N-deglycosylated protein was observed following concurrent O- 
35 glycosidase and neuraminidase treatment. Deglycosylation had no 
detectable effect on enzymatic activity. 

Unlike the baculovirus enzyme, expression of the full length 
heparanase polypeptide in mammalian cells (e.g., 293 kidney cells, CHO) 
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r> <— — '= d « ,he A 50kD 1 a T« ™ 

Vfold more active than .he 65 kDa form. A -™ ^f^™ 
observed when the specific activity of the recombtnan. 65 kDa bacnloy.rus 
was compared to that of the 50 kDa , 
purified from human piatelets, SK-hep-i ce.ls, or P>-"*J^ " ' ^ 
suggest that the 50 kDa protein is a mature processed fom. of a latent 
heparanase precursor. Amino terminai sequencmg of the platelet 
heparanase indicated tha, cieavage occurs between ammo acds glu 5 
y sV As indicated by the hydropathic plot of heparenase, Uu , «*. 
located within a hydrophi.lic peak which is likeiy to be exposed and hence 
5 accessible to proteases. r„ va *i„„ and 

Involvement of Heparanase in Tumor Cell Invasion 

e s must degrade the subendotheha, glycoproteins and P™£*£^ 
25 the BM in order to migrate out of the vascular compartment Several 

elastasc etc) are thought to be involved m degradation of BM (25). 
rung these en^mes is heparanase ttta, cleaves HS a. specific mtracham 
Z. (.6 ID- Expression of a HS degrading heparanase was found to 
3„ chelate vih the metastatic potential of mouse lymphoma 26) 
3 Z 11 and tndanoma (21) cells. Moreover, Cevatec lev* o 
heparanase were detected in sera from metric tumor *-°*~£> 
and melanoma patients (2, ) and in tuntor biopsies 

The inhibitory effect of various non-anucoagnlanl speces of 
35 heparin on heparanase was examined in view of their potent.a. use n 
" vILg ex— on of blood-bome ceH, Treatment . cxpenm enta 
animals with heparanase inhibitors markedly reduced (> 90 /o) the 
Tdence of lung metastases induced by BI6 —a, Lews lung 
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carcinoma and mammary adenocarcinoma cells (12, .3 28). Hepann 
7Z ionTwim high and low affinity .o anti-.hromb,n 111 exlubned a 
comZble high an.i-metasta.ic active indicating that the heparanase 
STJ^y of heparin, rather than its anticoagulant plays a 

ro ,e in the anti-metastatic properties of the po.ysaechar.de (.2^ 

The direct role of heparanase m cancer metastasis wa 
demonstrated by two experimental systems. The murine T-lymphoma eel 
"h s no detectable heparanase activity. Whether — 
he Hpa gene into Eb cells would confer a metastatic behav.o on Jhese 

mock transfected Eb cells were injected subcutaneously mto DBA/2 m ce 
Id mice were tested for survival time and Hver metres. All m e 
- (n-20) injected with mock transfected cells survtved durmg the .4 
weeks of the experiment, while 50% mortals was observed 
r„oeu ated with Eb cells transfected with the Hpa cDNA. The hver of m,ce 
inocolaS whh Hpa transfected cells was infiltrated with numerous Eb 
Zhoma cells as was evident both by macroscopic evaluahon of the 
, 0 Ce mnd microscopic examination of tissue section. In = 
metastatic lesions could not be detected by gross exammat.on of the hver 
Tf ^i novated with mock transfected control Eb cells. Few or no 
lloma cells were found to infiltrate the liver tissue. In a different 
Tdd of « metastasis, transient transfection of the heparanase ^ 
25 To low metastatic B16-FI mouse melanoma cells followed by ..v. 
. ♦ united in a 4- to 5-fold increase in lung metastases. 
^X^Z^ externally adhered to B16-F, melanoma cells 
increased the level of lung metastases in C57BL 

. i r«>e US Pat. application No. 09/260,Ui/, enuueu 

30 SS^STa ^oc,ca L P mate RI ae into , A = , 

which is a continuation in par. of U.S. Pat. appheauon No. 09/140,888, 
and is incorporated herein by reference. 

Possible involvement of Heparanase in tumor angtogene s 
fibroblast growth factors are a family of structural., f re nted 
35 polypeptides characterized by high affinity to hepann (29). They ^ 
nighW mitogenie for vascular endotttelial cells and are among the most 
potent Tnducers of neovascularization (29-30). Basic fibroblast growth 
Co ,bFGF) has been extracted from a subendothelia, ECM produced „ 
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W,,o (31) and from basement membranes of .he eornea (32) suggesung 
thaT ECM may serve as a reservoir for bFGF. Immunoh.stochem.ca 
ta ning «v"aL ,he .ocalization of bFGF in basemen, membranes of 
dWe«e .iles and blood vessels (23). Despite the ubiquitous presence of 
bFGF in normal tissues, endothelial eel, proliferation in the, t.ssnes , 
usually very ,ow, suggesting that bFGF is s— * 
site of action. Studies on the mteracuon of bFGF with ECM reve 
bFGF binds to HSPG in the ECM and ean be released ,n an active form by 
bFGrbmosto lt wa s demonstrated that heparanase 

stvoived n release of active bFGF from ECM and basemen, membranes 
3 T suggesting tha. heparanase activity may no. only function m cell 
l^oT and 8 —, bu. may also elicit — — 

, r^rrr::- 

, Z factors (36 37) Displacement of bFGF from its storage within 

".ion of neovascularization in normal and pa.holog.ca, s.tua »a, 
Reeent sludies indica,e .ha. heparin and HS are mvolved n binding 
„ of bFGF Thigh affinity cell surface recep.ors and in bFGF cel. signaling 
.38 39) Moreover, .he size of HS required for optimal effect was similar 
* at of HS fragments re.eased by heparanase (40) ■ ™ 
obtained with vascular endothelial cells growth fac.or (VEGF) (41) 
sulgesting .he operation of a dual receptor mechanism involving HS m 
25 cefl fraction with heparin-binding growth factors, h , merefore 
M p oposed .hat restriction of endothelial eel. growth factors m ECM 
Lvents fteir systemic action on the vascular endothelium, thus 
mining a very low rate of endothelial cells turnover and vessel 
Z2 O" *e oTher hand, release of bFGF from storage in ECM «. 
M elpiex with HS fragment, may elici. localized ^dnthehal el. 
proTferation and neovascularizafion in processes such as wound heahng, 
inflammation and tumor development (36,37). 

The involvement of heparanase in o,Uer p„ysio,o g ,cal presses 
and its potential therapeutic applications „„„,:„„ 
Apart from Us involvemen. in .umor cell meU.s,as,s .nflammaflon 
and autimmun><y, mammalian heparanase may be apphed «o modula e 
"aWUby of heparin-binding growth fac.ors; cellular responses 
h parin-bindTng grow* fac.ors (e.g., bFGF, VEGF) and cytokines (1L-8) 
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(44 41)- cell interaction with plasma lipoproteins (49); cellular 
susceptibility to certain viral and some bacterial and protozoa infections 
(45-47); and disintegration of amyloid plaques (48). 

Viral Infection: The presence of heparan sulfate on cell surfaces 
have been shown to be the principal requirement for the binding of Herpes 
Simplex (45) and Dengue (46) viruses to cells and for subsequent mfecuon 
onhe cells Removal of the cell surface heparan sulfate by heparanase 
m ay therefore abolish virus infection. In fact, treatment of cells with 
bacterial heparitinase (degrading heparan sulfate) or hepannase (degrading 
heparan) reduced the binding of two related animal herpes cruses to ce Is 
and rendered the cells at least partially resistant to virus infection (45> 
There are some indications that the cell surface heparan sulfate is also 
involved in HIV infection (47). 

Neurodegenerative diseases: Heparan sulfate proteoglycans were 
identified in the prion protein amyloid plaques of Genstmann-Straussler 
Syndrome, Creutzfeldt-Jakob disease and Scrape (48). Heparanase may 
disintegrate these amyloid plaques which are also thought to play a role m 
the pathogenesis of Alzheimer's disease. 

Restenosis and Atherosclerosis: Proliferation of arterial smooth 
, muscle cells (SMCs) in response to endothelial injury and accumulation of 
cholesterol rich lipoproteins are basic events in the pathogeneses of 
atherosclerosis and restenosis (50). Apart from Us involvement in _SMC 
proliferation as a low affinity receptor for heparin-binding growth factor 
HS is also involved in lipoprotein binding, retention and uptake (51). 1 
, 5 was demonstrated that HSPG and lipoprotein lipase participate in a nove 
catabolic pathway that may allow substantial cellular and interna 
accumulation of cholesterol rich lipoproteins (49). The latter pathway* 
expected to be highly atherogenic by promoting accumulation of apoB and 
apoE rich lipoproteins (e.g., LDL, VLDL, chylomicrons), dependent of 
30 feed back inhibition by the cellular cholesterol content. Removal of SMC 
HS by heparanase is therefore expected to inhibit both SMC proliferation 
and lipid accumulation and thus may halt the progression of restenos.s and 
atherosclerosis. 

Pulmonary diseases: 
35 The data obtained from the literature suggests a possible role for 

GAGs degrading enzymes, such as, but not limited to, heparanases 
connective tissue activating peptide, heparinases, hyluronidases, sulfatases 
and chondroitinases, in reducing the viscosity of sinuses and airway 
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secretions with associated implications on curtat.mg the rate of m cuon 
and inflammation. The sputum from CF patients eontams properties 
QAGs thus contributing to its volume and v.scous property 
Recombinant heparanase has been shown to reduce viscosity of sputum of 
CF natients (see U.S. Pat. application No. 09/046,475). 

^ntimarv, heparanase may thus prove useful for conditions such 
as wound healing, angiogenesis, restenosis, ^^''"^Z 
neurodegenerative diseases and viral infections. Mammahan heparanase 
„ be used to neutralize plasma heparin, as a potential replacement of 
protamine. Anti-heparanase antibodies may be apphed for 
•detection and diagnosis of micrometastases, autoimmune les.ons 
and renal failure in biopsy specimens, plasma samples, and body tafc 

There is thus a widely recognized need for, and u would be tughly 
advantageous to have, additional molecules with glycosyl hydrolase 
activity because such molecules may exhibit greater spectfic activtty 
^certain substrates or different substrate specificity than the known 
heparanase. 

^^^^vrJTHF INVENTION 

SUMb 7^-^^^re present invention there .s provtded 
an isolated nucleic acid comprising a pol^lcotide f 
SEQ ID NOs : l, 4, 6 or portions thereof at 68 °C ,n 6 x SSC 1 A SDS, 5 x 
Denharts, 10 % dextran sulfate, 100 pg/ml salmon sperm DNA, and 3Z p 
labeled probe and wash at 68 °C with 3 x SSC and 0.1 % SDS. 

According to another aspect of the present tnventton . then s 
" provided an isolated nnc.eic acid comprising a polynucleotide hybrtdtzaMe 
with SEQ ID NOs:l, 4, 6 or portions thereof at 68 °C m 6 x SSC 1 A 
SDS, 5 x Denharts, 10 % dextran sulfate, 100 pg/ml salmon sperm DNA, 
and 32„ , ab eled probe and wash at 68 °C with 1 x SSC and 0.1 % SDS. 

According to still another aspect of the present invention there .s 
provided an isolated nucleic acid comprising a polynucleotide hybndtzabfc 
with SEQ ID NOs:l, 4, 6 or portions thereof at 68 °C m 6 x SSC 1 A 
SDS, 5 x Denharts, 10 % dextran sulfate, 100 pg/ml salmon sperm DNA, 
and 32p tabeled probe and wash at 68 °C with 0.1 x SSC and 0.1 /. SDS. 

According to yet another aspect of the present mvention dtere » 
provided an isolated nucleic acid comprising a polynucleotide at leas. 60 
% identical with SEQ ID NOs:l, 4, 6 or portions thereof as determmed 
using the Bcstfit procedure of the DNA sequence analysts software 
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package developed by the Genetic ' Computer Group (GCG) at the 
university of Wisconsin (gap creation penalty - 50, gap extension penalty - 

3) ' According to still another aspect of the present invention there is 
provided an isolated nucleic acid comprising a V^^^ff *** 
polypeptide being at least 60 % homologous with SEQ ID NOs: 5 . Tor 
portions thereof as determined using the Bestfit procedure of the DNA 
sequence analysis software package developed by the Genetic Computer 
Group (GCG) at the university of Wisconsin (gap creation penalty - 50, 
gap extension penalty - 3). 

According to further features in preferred embodiments of the 
invention described below, the polynucleotide is as set forth in SEQ ID 
NOs l 4, 6 or portions thereof. 

' According to an additional aspect of the present invention there is 
provided a recombinant protein comprising a polypeptide encoded by the 
polynucleotides herein described. 

According to yet an additional aspect of the present invention there 
is provided a recombinant protein comprising a polypeptide at least 60 % 
homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined 
3 using the Bestfit procedure of the DNA sequence analysis ^ software 
package developed by the Genetic Computer Group (GCG) at the 
university of Wisconsin (gap creation penalty - 50, gap extension penalty - 

3) ' According to further features in preferred embodiments of the 
:5 invention described below, the polypeptide is as set fourth in SEQ ID 
NOs:3, 5, 7 or portions thereof. 

According to still an additional aspect of the present invention there 
is provided a nucleic acid construct comprising the isolated nucleic acid 
herein described. . 
30 According to a further aspect of the present invention there is 

provided a nucleic acid construct comprising a polynucleotide encoding 
the recombinant protein herein described. 

According to still a further aspect of the present invention there is 
provided a host cell comprising a polynucleotide or construct and/or 
35 expressing a recombinant protein as herein described. 

According to yet a further aspect of the present invention there is 
provided an antisense oligonucleotide or nucleic acid construct comprising 
a polynucleotide or a polynucleotide analog of at least 10 bases being 
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hybridizable in vivo, under physiological conditions, with (i) a portion of 
a ^nucleotide strand encoding a polypeptide at least 60 % homologou 
with SEQ ID NOs:3, 5, 7 or portions thereof as dctermmed usmg the 
Bestfit procedure of the DNA sequence analysis software package 
developed by the Genetic Conner Group (GCG) a, the untversrty of 
Wisconsin (gap creation penalty - 50, gap ex.ens.on penal* - 3); or (.. a 
portion of a polynucleotide strand a. least 60 % .denhcal w,.h SEQ ID 
NOs 1 4 6 or portions thereof as determined using the Bestfit procedure 
of fire DMA sequence analysis software package developed by the Gene,,c 
Computer Group (GCG) a, the university of Wisconsin (gap creauon 
nenalty - 50, gap extension penalty - 3). 

According to another aspect of the present invents there is 
provided a ribozymc comprising the antisense oligonucleotide herein 
described and a ribozyme sequence. 

The present invention provides polynucleotides and polypeptides 
belonging to a class of asp-glu glycosyl hydrolases of the GH-A clan, 
probably, based on homology to heparanase, GAG degrading enzymes. 

rrtpf DESCRIPTION OF THF DRAWINGS 

3 " ^^^^^ 

reference to the accompanying drawings, wherein: 

FIG 1 shows the nucleotide sequence (SEQ ID NOs:l-2) and the 
deduced amino acid sequence (SEQ ID NOs:2-3) of hnhpl' 

FIG 2 is a comparison of the deduced amino acid sequences ot 
, ^p;(SEQIDNOs:2-3)andofheparanase(SEQlDNO:9) Comparison 
was performed using the Gap program of the GCG package (gap creation 
oenalty - 50, gap extension penalty - 3); 

FIG 3 illustrates variability of hnhpl transcripts. Hnhpl was 
amplified from placenta and from testis marathon ready cDNA libranes 
30 using the gene specific primers pn9-312u (SEQ ID NO:14, and hnll-230 

(SEQlDNO:ll); . 

FIG 4 shows a zoo blot. Ten micrograms of genomic DNA from 
various species were digested with EcoK and separated on U%q» 
- TBE gel Following electrophoresis, the gel was treated w.th HC1 and 
M then with NaOH and the DNA fragments were downward transtoed to a 
nylon membrane (Hybond N + , Amcrsham) with 0.4 N NaOH. The 
membrane was hybridized with a 1.7 Kb DNA probe Jhat — d "te 
hnhpl cDNA (clone pn9). Lane order: H - Human; M - Mouse; Rt - Rat, P 



WO 01/00643 



PCT/1L0O/O0358 



_ Pig- Cw - Cow; Hr - Horse; S - Sheep; Rb - Rabbit; D - Dog; Ch - 
Chicken; F - Fish. Size markers (Lambda Brtdl) are shown on the left; 

FIG 5 illustrates cross hybridization between hpa and hnhpl. Hpa 
was amplified by PGR from marathon ready placenta cDN A hbrary. 
Hnhpl was amplified from testis marathon ready cDNA hbrary. PCR 
products were run on agarose gel in duplicates and transferred to a nylon 
membrane. One membrane was probed with 32 p labeled hpa cDNA and 
the other with hnhpl, clone pn9. 

FIG 6 is a comparison of the hydropathic profiles of heparanase 
and hnhpl. The curves were calculated according to the Kyte and Duhttle 
method over a window of 17 amino acids. 

FIG 7 shows a Western blot analysis of recombinant hnhpl 
expressed in human embryonal kidney 293 cells. A - control heparanase- 
FLAG precursor, B-D - 293 cells trasfectcd with a control P SI vector (B), 
P SI- P n6 (C) and P SI-pn9 (D). Cell extracts were separated by SDS- 
PAGE transferred onto Immobilon-P nylon membrane (Milhpore . 
Membrane was incubated with anti-FLAG Flag antibody 1:1000 (Kodak 
anti FlagM2 cat: IB 13025). 

, nF SPRIPTION HF THF. PREFFPPP" FMBODIMENTS 

T^present invention is of novel polynucleotides encodmg 
polypeptides distantly homologous to heparanase, nucleic acid constructs 
including the polynucleotides, genetically modified cells expressing same, 
recombinant proteins encoded thereby and which may have heparanase or 
5 other glycosyl hydrolase activity, antibodies recognizing the recombinant 
proteins, oligonucleotides and oligonucleotide analogs derived from the 
polynucleotides and ribozymes including same. 

The principles and operation of the present invention may be better 
understood with reference to the drawings and accompanying descnptions. 

Before explaining at least one embodiment of the invention in 
detail it is to be understood that the invention is not limited m its 
application to the details of construction and the arrangement of the 
components set forth in the following description or illustrated in the 
drawings The invention is capable of other embodiments or of being 
35 practiced or carried out in various ways. Also, it is to be understood that 
the phraseology and terminology employed herein is for the purpose of 
description and should not be regarded as limiting. 
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While reducing the present invention to practice the human EST 
database was screened for homologous sequences using the entire ammo 
acid sequence of human heparanase (SEQ ID NO:9). A distantly 
homologous fragment was pooled out, accession number A 222323, 
IMAGE clone number 1843155 from Soares_NFL_T_GBC_S 1 Homo 
Sapiens cDNA library prepared from testis B-cells and fetal lungs. The 
clone contained an insert of 560 bp (SEQ ID NO:23) of which the 3 
region was homologous to the human hpa gene encoding human 
heparanase. Primers derived from the newly identified clone were used to 
isolate several cDNAs including several open reading frames which reflect 
in frame alternative splicing, the longest of which, P n6, appears in Figure 
1 (SEQ ID NOs-1, 2 and 3) is 2060 nucleotide long and it contains an open 
reading frame of'l776 nucleotides, which encodes a polypeptide of 592 
amino acids, with a calculated molecular weight of 66.5 kDa. The newly 
cloned gene was designated hnhpl. Two shorter forms, P n9 and pn5 and 
their deduced amino acid sequences are set forth in SEQ ID NOs:4 and 6 
and SEQ ID NO:5 and 7, respectively, and are further described in the 
Examples section that follows. Comparison between the amino acid 
sequence of hnhpl and heparanase is shown in Figure 3. The homology 
, between the two proteins is 52.8 % or 55.3 %, depending on the software 
employed. No cross hybridzation was detected between hpa and hnhpl, 
even under very moderate wash conditions (Figure 5). Zoo blot analysis 
demonstrated that the hnhpl gene and other related genes, perhaps forming 
a new gene familly, are present in genomes of other organisms including 
.5 mammals and avians. The chromosome localization of hnhpl was 
determined using G3 radiation hybrid panel to be on human chromosome 
10 next to the marker SHGC-57721. The results also indicated a 
possibility of a second copy of the gene or of a related gene. The hnhpl 
gene is expressed in low levels in lymph nodes, spleen, colon and ovary; in 
30 slightly higher levels in prostate and small intestine; and in yet more 
pronouced level in testis. No expression was detected under the assay 
employed in bone marrow, liver, thymus, tonsil or leukocytes. Screening 
of the mouse EST database with the amino acid sequence of heparanase as 
well as of hnhpl pooled out a mouse EST clone (clone 1378452 accession 
35 number AI019269 from mouse thymus, SEQ ID NO:8). However, this 
clone includes two frame shift mutations which hamper its open reading 



frame. 
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The overall homology be.ween the amino aeid sequence of hnhpl 
and heparanase suggest that these two proteins share similar function. The 
homo ogy betweef the two proteins is concentrated a. several region . 
C may represent functional domains of the protein. The variability 
Tay suggest potential difference in substrate recognition, cellular 
1 realization and parameters of activity . 

D^pite I lac, of an overall homology between U». heparanase 
and other glycosyl hydrolases, the amino acid couple asp-glu (NE, SEQ ID 
NO- 3) whL is characteristic of the proton donor of glycosyl hydrops 
of the GH-A clan, was found at positions 224, 225 of heparanase. As m 
o her cfan members, this NE couple is located a. the end of a p strand As 
Iwn in Figure 2, the region surrounding the NE couple is conserved m 
*e plicted amino acid sequence of HnHpL This suggests ft* **/ 
product is a glycosyl hydrolase. This definition may include any 
pX—de degrading enzyme, either exo or endo and 
based on the similarity to heparanase i, is likely that ,« encodes a GAG 

superposition of the hydropathic profiles of 
heparanase and HnHpl (Figure 6) indicates an overlapping 
„ the oroteins The amino acid sequence characterise of glycosyl 
tdroCTs located within a hydrophilic peak and a, the same position » 
I aligned proteins. A remarkable difference in the hyfcopath, ^» 
noticeraround amino acids 157, 158 of heparanase, which constitu te the 
pressing site of the enzyme. While in heparanase, ttus sue ,s located a 
, 5 h fi f a hydrophilic peak, the equivalent region of hnhp. is rather no- 
hydrophilic. The peak around amino acid 1 10 of heparanase appears also 
amino acid' .30 of Cleavage of heparanase a, thrs region 

was shown to result in enzyme activation. The equivalent region ofMp' 
mieht be a potential processing site. 
30 * Heparanase has a potential signal peptide at the N-terminus of the 
67 kDa form. The homology between the two protems ,s low a. the N- 
termini and no signal peptide was identified in hnhpl 

According to one aspect of the present invention there is provided 
an isolated nucleic acid comprising a 
35 SEO ID NOs-1, 4, 6 or portions thereof at 68 °C >n 6 x SSC, 1 % SDS 5 x 
nenharts 10 % dextran sulfate, 100 pg/ml salmon sperm DNA, and p 
tabid probe and wash a. 68 -C with 3 x SSC, 1 x SSC or 0, * SSC and 
0.1 % SDS. 
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As used herein in the specification and in the claims section that 
follows, the term "portion" or "portions" refer to a consequtive stretch of 
nucleic or amino acids. Such a portion may include, for example, a least 
90 nucleotides (equivalent to at least 30 amino acids) at least 120 
« nucleotides (equivalent to at least 40 amino acids), at least 150 nucleotides 
(equivalent to at least 50 amino acids), at least 180 nucleotides (equivalen 
to at least 60 amino acids), at least 210 nucleotides (equivalent to at least 
70 amino acids), at least 300 nucleotides (equivalent to at least 100 ammo 
acids) at least 600 nucleotides (equivalent to at least 200 amino acids), at 
o least 900 nucleotides (equivalent to at least 300 amino acids), at least 
1 200 nucleotides (equivalent to at least 400 amino acids), at least 1,500 
nucleotides (equivalent to at least 500 amino acids), or more. 

According to another aspect of the present invention there is 
provided an isolated nucleic acid comprising a polynucleotide at least 60 
15 % preferably at least 65 %, more preferably at least 70 %, still preferably 
at'least 75 %, yet preferably at least 80 %, more preferably at least 85 A 
more preferably at least 90 %, most preferably at least 95 % - 100 /„, 
identical with SEQ ID NOs:l, 4, 6 or portions thereof as determined using 
the Bestfit procedure of the DNA sequence analysis software package 
20 developed by the Genetic Computer Group (GCG) at the university of 
Wisconsin (gap creation penalty - 50, gap extension penalty - 3). 

According to still another aspect of the present invention there is 
provided an isolated nucleic acid comprising a polynucleotide encoding a 
polypeptide being at least 60 %, preferably at least 65 %, more preferably 
25 at least 70 %, still preferably at least 75 %, yet preferably at least 80 /o, 
more preferably at least 85 %, more preferably at least 90 %, most 
preferably at least 95 % - 100 %, homologous with SEQ ID NOs:3, 5, 7 or 
portions thereof as determined using the Bestfit procedure of the DNA 
sequence analysis software package developed by the Genetic Computer 
30 Group (GCG) at the university of Wisconsin (gap creation penalty - 50, 
gap extension penalty - 3). 

As used herein in the specification and in the claims section that 
follows, the term "homologous" refers to identical + similar. 

According to an additional aspect of the present invention there is 
35 provided a recombinant protein comprising a polypeptide encoded by the 
polynucleotides herein described. 



WO 01/00643 



PCT/ILOO/00358 



The necleic acid according to the present invention can be a 
complementary polynucleotide sequence, genomic polynucleotide 
sequence or a composite polynucleotide sequence. 

As used herein the phrase "complementary polynucleotide 
sequence" includes sequences which originally result from reverse 
transcription of messenger RNA using a reverse transcriptase or any other 
RNA dependent DNA polymerase. Such sequences can be subsequently 
amplified in vivo or in vitro using a DNA dependent DNA polymerase. ^ 
As used herein the phrase "genomic polynucleotide sequence 
includes sequences which originally derive from a chromosome and reflect 
a contiguous portion of a chromosome. 

As used herein the phrase "composite polynucleotide sequence" 
includes sequences which are at least partially complementary and at least 
partially genomic. A composite sequence can include some exonal 
sequences required to encode a polypeptide, as well as some intronic 
sequences interposing therebetween. The intronic sequences can be of any 
source, including of other genes, and typically will include conserved 
splicing signal sequences. Such intronic sequences may further include cis 
acting expression regulatory elements. 
, Thus this aspect of the present invention encompasses (i) 

polynucleotides as set forth in SEQ ID NOs:l, 4 and 6; (ii) fragments or 
portions thereof; (Hi) sequences hybridizable therewith; (iv) sequences 
homologous thereto; (v) genomic and composite sequences coresponding 
thereto- (vi) sequences encoding similar polypeptides with different codon 
5 usage; and (vii) altered sequences characterized by mutations, such as 
deletion, insertion or substitution of one or more nucleotides, either 
naturally occurring or man induced, either randomly or in a targeted 
fashion. 

According to yet an additional aspect of the present invention there 
,o is provided a recombinant protein comprising a polypeptide at least 60 %, 
preferably at least 65 %, more preferably at least 70 %, still preferably at 
least 75 % yet preferably at least 80 %, more preferably at least 85 %, 
more preferably at least 90 %, most preferably at least 95 % - 100 % 
homologous with SEQ ID NOs:3, 5, 7 or portions thereof, as determined 
35 using the Bestfit procedure of the DNA sequence analysis software 
package developed by the Genetic Computer Group (GCG) at the 
university of Wisconsin (gap creation penalty - 50, gap extension penalty - 
3). 
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According to still an additional aspect of the present invention there 
is provided a nucleic acid construct comprising the isolated nucleic acid 
herein described. 

According to a preferred embodiment of the present invention the 
nucleic acid construct further comprising a promoter for regulating the 
expression of the isolated nucleic acid in a sense or antisense orientation. 
Such promoters are known to be exacting sequence elements required for 
transcription as they serve to bind DNA dependent RNA polymerase 
which transcribes sequences present downstream thereof. Such down 
stream sequences can be in either one of two possible orientations to result 
in the transcription of sense RNA which is translatable by the nbozyme 
machinery or antisense RNA which typically does not contain translatable 
sequences, yet can duplex or triplex with endogenous sequences either 
mRNA or chromosomal DNA and hamper gene expression, all as further 
detailed hereinunder. 

While the isolated nucleic acid described herein is an essential 
element of the invention, it is modular and can be used in different 
contexts. The promoter of choice that is used in conjunction with this 
invention is of secondary importance, and will comprise any suitable 
, promoter. It will be appreciated by one skilled in the art, however that it 
is necessary to make sure that the transcription start site(s) will be located 
upstream of an open reading frame. In a preferred embodiment of the 
present invention, the promoter that is selected comprises an element that 
is active in the particular host cells of interest. These elements may be 
5 selected from transcriptional regulators that activate the transcription of 
genes essential for the survival of these cells in conditions of stress or 
starvation, including, but not limited to, the heat shock proteins 

A construct according to the present invention preferably further 
includes an appropriate selectable marker. In a more preferred 
30 embodiment according to the present invention the construct further 
includes an origin of replication. In another most preferred embodiment 
according to the present invention the construct is a shuttle vector, which 
can propagate both in E. coli (wherein the construct comprises an 
appropriate selectable marker and origin of replication) and be compatible 
35 for propagation in cells, or integration in the genome, of an organism of 
choice The construct according to this aspect of the present invention can 
be, for example, a plasmid, a bacmid, a phagemid, a cosmid, a phage, a 
virus or an artificial chromosome. 
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Alternatively, the nucleic add construe, aeeording to this aspect of 
to pres nt invention further inc.udes a positive and a negat.ve se ect.on 
marked and may therefore be employed for se.ec.ing for homologous 
retina ion events, including, but not limited to, homologous 
ruination employed in knock-in and Knock-out procedure * On 
ordinarily skilled in the art can readily des.gn a k»oek-om or knock » 
constructs including bom positive and negafve se, c on g nes o 
efficiently selecting transfeeted embryonic stem cells that underwent a 
hfZgous recombination even, with the construct. Such cells can e 
introduced into developing embryos to generate ch.meras, the offsprmg 
SS - be tested for carrying the knock-out or k„ock-,n construe^ 
K oXou, and/or knock-in construct according to the present mvenfon 
2 Z used ,0 mrther investigate the functionality of the new gcn.Such 
constructs can also be used in somatic and/or germ cells gene thempy to 
destroy activity of a defective, gain of function allele or to replace the lack 
o"* of asilen, al.ele in an organism, thereby to down or upregu.** 
tZZX required. Further detni. relating to the —.on and use of 
taock ou. and knock-in construct, can be found in Fukush.ge, S. and 
*eda J E • Trapping of mammalian promoters by Cre-.ox sne-specfic 

Copeland, N.G, Mouse models of human dtseaae. Par. 1: Tectaques and 
resources for genetic analysis in mice. Genes -d Deve.opmen. U 0997) 
I 11- Bermingham, J.J., Scherer, S.S., O'Connell, S., Arroyo, E. Kalla, 
K A Powlh F.L. and Rosenfe.d, M.G, Ts.-l/Oct-6/SCIP regulates a 
,5 Lte Z in peripheral myelination and is required for norma! 
IpiLol Genes Dev .0 (.996) .75.-62, which are incorporated herem 

"""'"According to ye, another aspect of the present invention there is 
provided a host cell or animal comprising a nucleic acid construe, or a 
,„ portion .hereof as described herein. Methods of transferring host ce . 
both prokaryo.es and eukaryo.es, and organisms w,.h nucle. ac.d 
construes and selection of transfomtan.s (e.g Uturform d perils or 
.ransgenic animals) are well known to those of sk.lls ,n the an. In 
additL, once transfeeted, such cells and organisms ; can be des gned o 
35 direct the production of ample amounts of a recombmant pro e,„ wh ch 
2 then be purf.ed by known methods, including, bu, no. hrnUed .o 
vlus chrorLgraphy and gel elcCrophoresis method. Such a purged 
recombinant protein can serve for clicita.ion of anfbod.cs as further 
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protein purification are described in detail ,n reference 52, both are 
in ^~:^ t mIX aspect or tbe present invention tbere is 
provided an ongonncieotide of a, .east 17, at .east .8, a, leas, .9, a lea 
20 a. leas. 22, at least 25, a, leas, 30 or a, least 40, bases specially 
hv'hridiyable with the isolated nucleic acid described herem. 
hybnd H^ion of shorter nucleic acids (below 200 bp in leng*,^ 
, n-40 bp in length) is effected by stringent, modern, or mtld 

U-n, — — ^™3m",0^M 
hvbridizauon so lut.on of 6 x SSC £1 ^ % ^ ^ ^ 

SDS at 1 - 1-5 C below th m, m q q1 m 

hybridization solution of 6 x SSC ana u.i /o 

™sh t 22 4 whereas mild hybridization is effected by a hybndrzahon 
Zf fix SSC and 1 % SDS or 3 M TMACI, 0.01 M sodmm 
K phosphate (pH 6-8) 1 EDTA (pH 7.5), 0.5 % SDS, .00 ^m. 
phosphate tpH o. hybridization 

,„ ° C ' According to an additional aspect of the present invention there is 
orovideTa pa " of oligonucleotides each independently of a, .east ,7, at 
* Zt 19, a. least 20, at leas. 22, a. least 25, a, least 30 or a 
leas 4<i bases specifically hybridizable with the isolated nuc.eic a ad 
des ribed herein in an opposite orientation so as to direct exponents 
„ amotion of a port.on thereof in a nucleic acid amplified reaction 
ita polymerL chain reaction. The poiymerase chain react,™ an 
„"her nucleic acid amplified reactions are well known m the art and 
"no further description herein. The pair of ohgonucleo.ides 
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according to this aspect of the present invention are preferably selected to 
have compatible melting temperatures (Tm), e.g., melting t»p«tu» 
which differ by less than that 7 °C, preferably less than 5 Q more 
preferably less than 4 °C, most preferably less than 3 °C .deally between 3 
> o C and zero °C. Consequently, according to yet an add.tional aspect of 
the present invention there is provided a nucleic acid amplification product 
obtained using the pair of primers described herein. Such a nucleic acid 
amplification product can be isolated by gel electrophoresis or any other 
size based separation technique. Alternatively, such a nucleic acid 
o amplification product can be isolated by affinity separation ei her 
strandness affinity or sequence affinity. In addition, once isolated, such a 
product can be further genetically manipulated by restriction, ligation and 
the like, to serve any one of a plurality of applications associated w.th up 
and/or down regulation of activity. 

According to still an additional aspect of the present invention there 
is provided an antisense oligonucleotide comprising a polynucleotide or a 
polynucleotide analog of at least 10 bases, preferably between 10 and 15 
more preferably between 50 and 20 bases, most preferably at least 17 a 
least 18, at least 19, at least 20, at least 22, at least 25, at least 30 or at 
20 least 40 bases being hybridizable in vivo, under physiological conditions 
with (i) a portion of a polynucleotide strand encoding a polypeptide at leas 
60 o/o, preferably at least 65 %, more preferably at least 70 % stil 
preferably at least 75 %, yet preferably at least 80 %, more preferably ^at 
least 85 %, more preferably at least 90 %, most preferably at least 95 A - 
25 1 00 o/o homologous to SEQ ID NOs:3, 5, 7 or portions thereof as 
determined using the as determined using the Bestfit procedure of the 
DNA sequence analysis software package developed by the Genetic 
Computer Group (GCG) at the university of Wisconsin (gap creation 
penalty - 50, gap extension penalty - 3); or (ii) a portion of a 
30 polynucleotide strand at least 60 %, preferably at least 65 /o more 
preferably at least 70 %, still preferably at least 75 %, yet preferably at 
least 80 %, more preferably at least 85 %, more preferably at least 90 /* 
most preferably at least 95 % - 100 % identical with SEQ ID NO.-.1. 4, 6 
or portions thereof as determined using the Bestfit procedure of the DNA 
35 sequence analysis software package developed by the Genetic Computer 
Group (GCG) at the university of Wisconsin (gap creation penalty - 12, 
gap extension penalty - 4). 
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Such antisense oligonucleotides^ be used << > 

« f.,nher detailed hereinunder. Such an antisense 

lis;., — - — * 
««» .«.—». »— **• <«■— ««»» *» 

"•*r,r ™~"...«. — - — — r-; 

(rTbosomes), to the target mRNA, a phenomenon known .n the art as 

pL— gists who were devoid a, deveioping the anfsense approach 
— ^SSli- eoUgo„uc.eo,ideshavebeen« 
30 arrest hTmatopoLc ceH pro.ifem.ion, growth, entry into the S phase of 
Z e« eyde reduced survival and prevent receptor mediated responses^ 
For efficient m vivo inhibition of gene expression usmg ant, en 
nucleotides or anaiogs, the oligonucleotides or ana ogs must MfiU 
the following requirements (i) sufficient spec.ficty m bmdmg to the target 
35 Z*L- 00 solubility in water; (Hi) stabile aga.ns, rntra- and 
" e"li nucleases; (iv) capability of penetration through the eel, 
membrane; and (v) when used to treat an organism, low toxrety. 
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Unmodified oligonucleotides are typically impracucal for use as 
antiscnse sequences since they have short * vivo half-hves, dunng wh, h 
hey are degraded rapidly by nucleases. Furthermore, they are d.fflcu U to 
prepare in more than milhgram quantities. In add.t.on, such 
oligonucleotides are poor cell membrane penetrate* 

Thus it is apparent that in order to meet all the above 1 sted 
requirements, oligonucleotide analogs need to be dev.sed m a su.tab e 
tine, Therefor;, an extensive search for modified ohgonuc.eot.des has 

^ "ample, problems arising in connection with double-stranded 
DNA (dsDNA, recognition through triple helix formation have been 
Wished by'a clever "switch back" "'^^hin 
sequence of polypurine on one strand is recogmzed, and by swttctang 
back" a homopurine sequence on the other strand can be recogn^. 
A Tso good helix formation has been obtained by usmg art.fica. bases^ 
teeby improving binding conditions with regard to ton.c strength and 

,„ addition, in order to improve ha.f-l.fe as we,, as membrane 
penetration, a .arge number of variations in polynueleot.de backbones 
0 have been done, nevertheless with little success. 

Oligonucleotides can be modified either in the base, the sugar « the 
phosphate moiety. These modifications include, for example, the use of 
meZphosphonates, monothiophosphates, diphosphate, 

phoShorarmda.es, phosphate esters, bridged phosphor 
„ phosphoramidates, bridged methylenephosphona.es, dephospho 
intelcleotide analogs with siloxane bridges, carbonate b.dges 
carboxymethyl ester bridges, carbonate bridges, carboxymethy. ester 
b M eTace.am.de bridges, carbama.e bridges, .hioemer bridges ^sulfoxy 
bridges, su.fono bridges, various "plastic" DNAs, a-anomer.e bndges and 

M """iSSL patent appl.ca.ion WO 89/12060 discloses various 
bui.ding blocks for synthesizing oligonucleotide analogs, as well 
oCucleotidc analogs formed by joining such build ng Hocks m a 
deLd sequence. The building blocks may be Cher ng.d (, e 
,5 chaining a ring s.ructure) or •■fiexib.e" (i.e., .acking a nng struc.ure). In 
H cases, the building b.ocks contain a hydroxy group and a mereapto 
gmup through which ,he building blocks are said .0 ,o,n to form 
o ™le.eotide analogs. The linking moiety in the ohgonucleonde 
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analogs is selected from the group consisting of sulfide (-S-), sulfoxide (- 

SO-), and sulfone (-S02-)- ,. 

International patent application WO 92/20702 describe an acychc 
oligonucleotide which includes a peptide backbone on winch any selected 
chemical nuc.eobases or analogs are stringed and serve a, codmg 
characters as they do in natural DNA or RMA. These 
known as peptide nucleic acids (PNAs), are no, only more 
Z their nafura, counterparts, bu, also bind natural DNA and BNA 50 to 
100 times more tightly than the natural nucleic ac.ds chng to each o*en 
PNA oligomer can be synthesized from the four protected monomers 
containing thymine, cytosinc, adenine and guanine by Memfield sol.d- 
phase peptide synthesis. In order to increase solubihty m water and to 
pl^ent aggregation, a lysine amide group is placed a. the C-.ermma, 
region and may be pegylated. m . 
Thus antisense technology requires pamng of messenger RN A 
with an oligonucleotide to form a double helix that inhibits uanslat.on 
The concept of an.isense-media.ed gene therapy was already unreduced m 
,978 for cancer therapy. This approach was based on certa.n genes .hat 
are crucial in cell division and growun of cancer cells. SyntheUc fragment 
, of genetic substance DNA can achieve this goal. Such molecules tad £ 
the targeted gene molecules in RNA of tumor cells, .hereby tnh.b,.,n *e 
translation of the genes and resulting in dysfunctional growth of these 
cells. Other mechanisms has also been proposed. These stra.eg.es have 
been used, with some success in treatment of cancers, as well as other 
.5 illnesses including viral and other infectious diseases. Anusense 
' 5 „ ^nucleotides ara typically synthesized in lengths of ,3-30 nucleobdes. 
The life span of oligonucleotide molecules in blood ,s rather short. Thus, 
they have to be chemically modified to prevent destruction by ub.qu.tous 
nucleases present in the body. Phosphorothioa.es are very w.dely used 
30 modif.ca.ion in anusense o.igonucleo.ide ongoing clinical i tnais .Aw 
generafion of antisense molecules cons.st of hybrtd anusense 
oligonucleotide with a centra, portion of synthetic DNA while four bases 
on each end have been modified win, 2'0-me.hyl ribose to resembte RNA. 
in preclinical studies in laboratory animals, such compounds have 
35 demonstrated greater stability to metabolism in body tissues and an 
improved safety profile when compared with the firs.-generafon 
unmodified phosphorothioate. Doscns of other nucleotide analogs have 
also been tested in antisense technology. 
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RNA oHgonuc.eo.Wes may also be used for antisense taUHAm as 
u r =..w c RNA-RNA duplex with the target, suggesting efficient 
^SSo theiHow stability RNA ohgonueleo.ides are 
uih.b, mn. However ' . vectors designed for this purpose. 

mabU r n ""^ns have validated the efficacy of 

for CMV retinitis (Pharmacotherapy News Network). 

= drugs in r : r Z^^L 

a, .he leve, of 

r;"„ dTof - - — «— - r 

Zefo e tahibu ulcription. A.tematively, they may prevent duplex 
"ingtd, therefore": transcription of genes within the tnple hehcal 

W "us, according to a further aspect of the present mventior 

provided a pharmaceutical composition compnstng the anfse e 
ofeonucleotide described herein and a pharmaceuuca ly acceptable 
S te pharmaceutical., acceptable carrier can be for example a 

35 Zsome loaded with the antisense oligonucleotide. Formu.atton for 
Z Administration may include, bu, are not limited .< , W£ 
2, m ents aels creams, suppositories, drops, liquids, sprays and powders. 
C—naf pbarmacLutic, carriers, aqueous, powder or oi.y bases, 
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thickeners and the like may be necessary or desirable. Compositions for 
„ra, administration inciude powders or grannies, suspenses or — 
in water or non-aqueons media, sachet, capsnles or table*. Thickeners 
diluents, flavorings, dispersing aids, emuisifers or bmders ^ be 
desirable. Formuiations for parentcrai administrate may mclude but are 
Tot iimited to, steriie aqueons solutions which may also contatn buffers, 
diluents and other suitable additives. 

According to still a fnrther aspect of the present tnvenuon there ts 
provided a ribozyme comprising the antisense oligonucleotide descnbed 
hln and a ribozyme sequence fused thereto. Such a nbozyme ,s readtly 
synlhesizable using solid phase oligonucleotide synthesis. 
" Ribozymes are being increasingly used for the sequence-spec fie 
inhibition of gene expression by the cleavage of mRNAs encodmg 
m„f uJL The possibilhy of designingribozymes ,o cleave ^ 
, specific target RNA has rendered them valuable tools ,n both bastc 
ZL and therapeutic applications. In the therapeutics area, nbo^me 
have been exploited to targe, viral RNAs in infecttous d.seases, dominant 
oncogenes in cancers and specific somatic mutations m »»» ***** * 
Most notably, several ribozyme gene therapy protocols for HIV patterns 
„ are already in Phase ! trials. More recently, ribozymes have been used for 
rsgenic y anima. research, gene target validation ^P»»~ 
Several ribozymes are in various stages of chntcal trta.s ANOIOZYME 
was the first chemically synthesized ribozyme to be studted m human 
clinical trials. ANOIOZYME specifically inhibits format.on of the VEGF- 
25 r (Vascular Endothelial Growth Factor receptor), a key component m the 
angiogenesis pathway. Ribozyme Pharmaceuticals, Inc., as well as othe 
fil have demonstrated the importance of anti-angiogenests therap ut, 
in animal models. HEPTAZYME, a ribozyme designed to selecttvely 
destroy Hepatitis C Virus (HCV) RNA, was found effective ,„ decreasing 
,0 Hepatitis C viral RNA in cell culture assays (Ribozyme Pharmaceuttcals, 
Incorporated - WEB home page). 

According to still another aspect of the present mventton there s 
provided an antibody comprising an immunoglobulin spectfically 
recognizing and binding a polypeptide a. leas, 60 % preferably a I as, 65 
35 % more preferably a, least 70 %, still preferably at least 75 A, yet 
pr ferably at least 80 %, more preferably a, leas, 85 %, more preferably a. 
Las, 90 %, mos, preferably a, leas, 95 % - 100 % homologous (idenucal + 
similar) ,o SEQ ID NOs:3, 5, 7 or portions thereof using as determmed 
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5 mvention the anubod P s 

The present in immunoreactive 
polyelonal antibodies or ^^^odies or fragments thereof, 
derivative of an antibody), or monoclonal antibodie ^ I g 
„ Monoeiona, antibodies or purified *^£ZZ%£ t such as 
having a. teas, a portion of an antigen bind ng reg om met g 
Fv, F(ab>,2, Fab fragments (Harlow and 8)> * meric 

or humanized antiooaies dim k P „ r ;Firation of these serum 

— :r^:~ - ine 1U d ta g, 

rllTr or sodium sulfate followed 

ion exchange chromatography, affinity or .mmunoaffimty 

=™^-^™" ; 

chains. Additional classes includes IgD, Igfc, IgA. S 
30 "^Methods for the generation and se.ec.ion of monoclonal antibodies 
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. • Animals are typically immunized with a mixture comprising a 

achieves an enhanced immune response to me tmmunogen. By^y of 
e^alle the primary immumzation convenient.y may he accomphshed 
vTa mixmre of a solution of the recombinant protein of the present 
" „ ioTatd Frennd, compiete adjuvant, said mixture bei„ 8 ; prepay m 

_ « * * * The immunization schedule ot the lmmunogen 

by any convenient method including by way of examp c 

===== 

« s en d to identify cultures containing antibodies mat recogmze h 
TptTof choice. Hybridomas that secrete antibodies • * 
i • . nrntein 0 f the present invention are cloned by limiting 

3S antibodies are purified and characterized in terms of immunogiobuhn type 
and binding affinity. 
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Additional objects, advantages, and novol features of the present 



EXAMPLES 



Referenoe is now made to the following examples, which together 
with the above descriptions, illustrate tine inventton m a non hmtttng 

^Generally the nomenclature used herein and the laboratory 
procedures * Lmbinant DNA technology described below are those 
weCl and commonly employed in the art. Standard techn.ques are 
Tsedfer cloning, DNA and RNA isolation, amplification and punficaUon. 
Gene a 1 en^atic reactions involving DNA ligase, DNA polymerase 
12 enlnucleases and the like are performed accordmg ; « the 
^ufacurers' specifications. Tnese techniques an< = s °,h - 
tp . ^ ee nerally performed according to Sambrook et ai., 

techniques a^e gene y P Manua Co , d Spring H arbor 

molecular Cloning - Harbor N Y (1989), which is incorporated 

~rov!Te P d for the convenience of the reader. All the information 
contained therein is incorporated herein by reference. 

Materials and Experimental Methods 
The following protocols and experimental details are referenced m 
the Examples that follow: 
, Primers list: 

5*-GGAGAGCAAGTCTGTGTTGATTC-3' (SEQtDNO.,0, 

IX 5'-CACTGGTAGCCATGAGTGTGAG-3" (SEOIDNO'12) 

ta ,„ 35 0 5-.TTGGTCATCCCTCCAGTCACCA-3' (SEQIDNO.12, 

2** 5'-CTTGCCTGTAGACAGAGCTGCAG-3' (SEQ,DNO:,4, 

5 I! ,-GAGCAGCCAGGTGAGCGGAAGA., (SE Q1 DNO,6, 

5 5'-TCAGATGCAAGCAGCAACTTTGGC-3' (SEQ 1DN0:I7) 

- 1 ^ACCCTGATGTGATGCTGGAG-, (SEQ,D N 0:,S, 

I 3 5'^ATCTAGGAGAGCAATGACGTTC-3' <SEQ,D*0:,„ 



PCT/1L00/00358 



5'.CCATCCTAATACGACTCACTATAGGGC-3' 
Z 5'-ACTCACTATAGGGCTCGAGCGGC-3' ,seq,dso,o 

S DnTwL exacted fro. animal or from human blood 

labeled probe. Pn9, a 1.7 kb fragmen (de l ; 473-634, SEQ 

reading frame except for a ~^~^ on , the me mbra„e 

10 N ° ; b r^fx SSC' % at ol °C and exposed to X-ray fi.m 
was washed with 3 x bbL., u.i . ,, sc „ , Vo SDS> a t 

for 3 days. Membranes were then washed with 0. 1 x SSC, U.I/ 
i 68 °C and were re-exposed for 4 days. 

m^Z prepared using TRl-Reagent (Molecular research center 
lnc , manufacturer —-^1—1 

reverse transcription reaefon —•.*J*^ J (Prom ega). 
, (Oibco BRL) and OUgo (dT),5 <SEQ ^ N ^ ^ Ta , 

Mannheim). 

25 Cn"— nf W cre performed with vector specific and 

1 nrimers using an automated DNA sequencer (Appl.ed 
gene specfic pnmers ^ ^ ftom a( ^ , wo 

penalty - 50, gap extension penalty - 3). 

quantitative PGR. cDNA panels were ^^^^^ 
performed with the gene specific pnmers hnlu350 (SEQ ^HOffl 
taU116 (SEQ ID NO:10). PCR program was as follows. U 
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ssssssssssKasa 

35 and 40 cycles. 

Chromosome localization: oerfo nned using the 

Chromosome loc— of hnhp » pe* ^ 

human genome center at the w fic ^ 

fragment of hnhpl gene was amp 1 Hed ustng t 8 " 
hnlu350 (SEQ ID NO:12) and 1.11116 (SEQ ID NaiO P 

<• u™«- 04 °C 3 minutes, followed by 39 cycles oi > 
ronl " c, T nlte, 72 °C, 1 min. Analysis of results was done 
tooth the RH server a, the Stanford human genome center. 

EXAMPLE 1 
Cloning an EST for a novel heparanase gene 

^crs^—rArTu^meB^ 2.0 server a, .e 

NCBI, basic BLAST search, tblastn program. n 
. A distantly homologo . fragment wa^poo, ^ 

^^C,l Homo Sapiens ^J^J^JZ 

follows. Score 4 rf ^ „ umber AI222 323 

B onhe 3 q of Cone 1S43155 (complementary to 

nucleotides 165-543 of SEQ ID NOW ^ , ( 

™ S Cl0 " e T,::^ D NO 2 rThe entire nucleotide 
contained an insert of 560 bp (SEQ ID Nt^ 1 

- — ■ - r~r b -:i:. 8 T;,55and^cDNA 

human heparanase. The homology ^ J9 % 

was restricted to the 3' ^Z ^lTl Le ,843155 (SEQ ID 
homology between nucleo The deduced amino acid 
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, 98 (SEO ID NO:23) was unrelated to heparanase. This unrelaeo 
1-98 (bfcA^ inw.z-j; jiffi-r-nt rDNA clone from the 

assumed to he a chimera, which contains two 
unrelated partial cDNAs ligated to a single veelor. 

EXAMPLE 2 
Cloning a cDNA for a novel heparanase gene 

, . .„ L,; rr cDNA three primers were designed 
l0 order to isolate * enure DNA ft, P ^ 

C on^ p! AUo, California) according to the manufacturer 
ends, a ° nK ?;^ c £ ; was performc(J with the gene specific pnmer 
instructions. The lirst cycie w« y NO:20). 

5 The second cycle was performed with the gen p v 

(SEQ ID NO:ll) and the universal pnmer Ap2 (SEQ ID « J 

easy (Promega, Mad.son, WI) determ med. The consensus 

» independent clones P n5, pn6 and pn9 were detenu 

,„ NOT which correspond to amino acids 150-203 of ot-M 

result pn9 encodes a putative polypeptide of 53S amino acids (SEQ 

^°»*™^ 4 ^1SL. it encodes a putative 
- =-^»l=o^ 
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for translate start s.te. T ™' al '° n ^ adds wi , h calal , a ,ed 

molecular ' °J of the to,* cDNA using two 

,ength was confirmed by PGR ampl ^ ^ closc 

g ene specfic ^» which overlaps the stop codon 

t: Ma^on ready cDNA ^■ttt'i'lS S 
PCR products are shown in figure 3. Four banns we 

deletions cna cons titute a protein family. The 

" ^ZSZJX seances, the homo.ogy ~ the two 
X. * *" % ««"* " %Sim :' ali,y ' T* h T'° The BeftTm 
* -STS £ - "h^ ~the two 

ot Z, - *- - « % - ea,e„.a.ed by the BestFU progrant. Th 
3„ homologous region is located ^"T T^lZw Using 
(SEO ID NO: 1) and nucleotides 564-1708 of hpa (SEQ ID N0.24). us g 
Hap Program the homology is 51 % over the enfre se q ue„ce gap 
creation penalty - 50, gap extension penalty - 3. 

EXAMPLE 3 
J Zoo blot 

Hnhpl cDNA was used as a probe to detect homologous sequences 
in buman DN A and in DNA of various animals. The autorad.ogram of the 
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Southern analysis is presented in Figure 4. Several bands were detected* 
human DNA. Several intense bands were detected in all mammals while 
faint bands were detected in chicken. This correlates w lt h the 
phylogenetic relation between human and the tested animals. The intense 
bands indicate that hnhpl is conserved among mammals as well as 
genetically distant organisms. The multiple bands patterns suggest that in 
all animals, hnhpl locus occupies a large genomic region. Several specific 
bands disappeared after stringent wash. These may represent homologous 
sequences and suggest the existence of a gene family, which can be 
isolated based on their homology to the human hnhpl reported here. 

EXAMPLE 4 
comparison to heparanase via cross hybridization 

In order to check the capability of hpa and hnhpl to cross 
hybridize under low stringency conditions, the entire coding reg.on of the 
human hpa and hnhpl were amplified by PGR. Human hpa was amphfied 
from platelets mRNA by RT-PCR using the primers hpu-685 (SEQ ID 
NO:16) and h P 1967 (SEQ ID NO:17), and hnhpl was amplified from testis 
using the primers hnll230 (SEQ ID NO:ll) and pn9-312u (SEQ ID 
, NO-14) The products were quantified and samples of 100 pg and 1 ng 
were run on agarose gel and subjected to Southern hybridization. The 
membranes were probed with labeled hpa cDNA and with hnhpl 
cDNA No cross hybridization was observed (Figure 5) even after over 
exposure for 5 days. Since hpa is the most similar sequence known today 
, 5 to that of hnhpl, this experiment indicates that the bands detected in the 
autoradiograph of Figure 4 are of the hnhpl gene or of yet unknown 
sequences homologous thereto, which might constitute a gene family 
This further indicated that such sequences are isolatable using the hnhpl 
as a probe to screen the relevant libraries, or using hnhpl denved PCR 
30 primers to amplify the relevant cDNA or DNA sequences. 

EXAMPLE 5 
Chromosome localization 

The chromosome localization of hnhpl was determined using G3 
35 radiation hybrid panel. Hnhpl was amplified from 83 human/mouse 
radiation hybrids. The results were analyzed by the RH server and the 
hnhpl gene was mapped to chromosome 10, next to the marker SHGC- 
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57721. The results also indicated a possibility of a second copy of the 
gene. 

EXAMPLE 6 
Expression Pattern ofhnhpl 

The tissue distribution ofhnhpl transcripts was determined using 
calibrated human cDNA panels (Clontech, Palo Alto, Ca). The results are 
shown in Table 1 below. Expression level is generally low. PGR products 
were clearly observed only after 40 cycles of amplification. 

TABLE 1 

Tissue hnJJ40cydes} 

Bone marrow 

Liver 

Lymph node + 
Leukocytes 

Spleen + 

Thymus 

Tonsil 

0 Colon + 

Ovary + 
Prostate ++ 
Small intestine ++ 
Testis +++ 

EXAMPLE 7 
cloning of a Mouse homologue 
Screening of the mouse EST database with the amino acid sequence 
of heparanase as well as ofhnhpl pooled out a mouse EST clone, which 
so shares distant homology with heparanase and a remarkab ly & 

with hnhpl. The EST clone 1378452 access^ number AI019269 ^from 
mouse thymus was 35 1 nucleotide long and it is set forth ,n SEQ ID NO A 
It has 61-63 % identity over 161 nucleotides (191-351, SEQ ID NO:8 to 
the human (SEQ ID NO:24) and mouse (SEQ ID NO:15) hpa nuc eot.de 
3s fences, and 93 % to hnhpl nucleotide sequence (SEQ ID NO:l) usmg 
the BestFit program of the GCG package. The nucleotide sequence of this 
done did not contain an open reading frame. Two frame sh.fts were 
identified in the sequence found in the EST database, as compared to the 
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This frame shifts were later confirmed by nueleolide 
HnHp, sequence. Th, ^ frame ,* rf ^ ftagme „ t 

sequence analysts of this clone as J nucleotide 

from BL6 mouse melanoma cells and de— fton 
sequence. This mouse gene is ^ were obtaine d 

of expression were ind cated as no a^— P ^ 

KS=l^S : S^ i ■ ;, -- , - ,, 

(SEQIDNO:18)andmnll563 (SEQlDNO:19). 



EXAMPLE 8 



Fxoression ofhnhpl in mammalian cells 

.r^^- - » i 3' - - - - 

reading frame. coding sequence was 

, f<u °r 45 seconds 64 °C, 45 seconds and 72 C, 2 min. 
cycles of 94 C, 45 second, <j n GEM-T-easy, and 

P n6FandpGEM-pn9E mamma ,ian expression 

and the second contained the ^^ij, 

- re T^Ba2«^ — ft-eS-endofhnl- 
fragments: ? 9 , B amHI - Noll fragment which 

digested with EcoRI-Notl. 
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The pSI-pn9 expression vector was constructed similarly, by triple 
ligation of'he following fragments: an EcoRl - Ssp, fragment, wh.ch 
,h, V end of hnhpl-pn6, excised from P Gem-T-*asy-pn9, an 

from pGem-pn6F and pSl digested with EcoR 1 - Not I. 

The resulting plasmids were transfected .nto human embryonal 
Kidney 293 cells, using the Fugene transfection reagent (Boehnnger 
TnLim>. Fortnight hours fol.owing transfection ce>,s — 

•SZ 1 - FLAG antibody 1:1000 dUution ^ 

^ TBI 3025 final concentration 10 ng/ml). Proteins 01 

Z~LZ f^a and 60 kDa were detected in cells transfected wuh 

pS n F and P SI-pn9F respectively. These proteins are s.m.lar ,„ s z to 

*ose predicteo by the calculated mo.ecular weight for the transhtt on 

IZ1 corresponding open reading frames. It is demonstrated Ota 

Lth the entire hnhp, cONA and the P n9 sphce form^ « — dly 

transcribed and translated in human 2VJ cens. 

Ca-ase the Hnhp, protein products do no. undergo major processmg 

° '""ugh the invention has been described in -Jun« 
specific embodiments thereof, it is evident mat many »v«, 
modifications and variations will be apparent to those skdled m the art. 
A^~i« is intended ,0 embrace all such alternatives, mod.ficat.ons 

25 "afiL that fall within me spirit and broad scope o the : appended 
claims All publications cited herein are incorporated by reference m then 



entirety. 
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WHAT IS CLAIMED IS: 

1 An isolated nucleic acid comprising a polynucleotide 
hybridizable with SEQ ID NOs:l, 4, 6 or portions thereof at 68 °C m 6 x 
SSC 1 % SDS, 5 x Denharts, 10 % dextran sulfate, 100 ug/ml salmon 
sperm DNA, and 32 p labeled probe and wash at 68 °C with 3 x SSC and 
0.1 %SDS. 

2 An isolated nucleic acid comprising a polynucleotide at least 
60 o/o identical with SEQ ID NOs:l, 4, 6 or portions thereof as deterrmned 
using the Bestfit procedure of the DNA sequence analysis > softw re 
package developed by the Genetic Computer Group (GCG) at the 
university of Wisconsin (gap creation penalty - 50, gap extensmn penalty - 
3). 

3 The isolated nucleic acid of claim 2, wherein said 
polynucleotide is as set forth in SEQ ID NOs:l, 4, 6 or portions thereof. 

4 An isolated nucleic acid comprising a polynucleotide 
encoding a polypeptide being at least 60 % homologous with SEQ ID 
NOs-3 5 7 or portions thereof as determined using the Bestfit procedure 
of the DNA sequence analysis software package developed by the GeneUc 
Computer Group (GCG) at the university of Wisconsin (gap creat.on 
penalty - 50, gap extension penalty - 3). 

5. A recombinant protein comprising a polypeptide encoded by 
the polynucleotide of claim 1. 

6. A recombinant protein comprising a polypeptide encoded by 
the polynucleotide of claim 2. 

7. A recombinant protein comprising a polypeptide encoded by 
the polynucleotide of claim 3. 

8 . A recombinant protein comprising a polypeptide encoded by 
the polynucleotide of claim 4. 
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9 A recombinant protein comprising a polypeptide at least 60 
% homologous with SEQ ID NOs:3, 5, 7 or portions thereof as determined 
using the Bestfit procedure of the DNA sequence analysis software 
package developed by the Genetic Computer Group (GCG) at the 
university of Wisconsin (gap creation penalty - 50, gap extension penalty - 
3). 

10 The recombinant protein of claim 9, wherein said 
polypeptide is as set fourth in SEQ ID NOs:3, 5, 7 or portions thereof. 

11. A nucleic acid construct comprising the isolated nucleic acid 
of claim 1 . 

12. A nucleic acid construct comprising the isolated nucleic acid 
of claim 2. 

13. A nucleic acid construct comprising the isolated nucleic acid 
of claim 3. 

14. A nucleic acid construct comprising the isolated nucleic acid 
of claim 4. 

15. A host cell comprising the nucleic acid construct of claim 

11. 

16. A host cell comprising the nucleic acid construct of claim 

12. 

17. A host cell comprising the nucleic acid construct of claim 

13. 

18. A host cell comprising the nucleic acid construct of claim 

14. 

19 An antisense oligonucleotide comprising a polynucleotide or 
a polynucleotide analog of at least 10 bases being hybridizable in vivo, 
under physiological conditions, with: 
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20. 



a portion of a polynucleotide strand encoding a polypept.de 
at least 60 % homologous with SEQ ID NOs:3, 5 7 or 
portions thereof as determined using the Bestfit proced- of 
the DNA sequence analysis software package developed by 
the Genetic Computer Group (GCG) at the umvers.ty of 
Wisconsin (gap creation penalty - 50, gap extension penalty - 

portion of a polynucleotide strand at least 60 % identical 
wfth SEQ ID NOs:l, 4, 6 or portions thereof as determined 
using the Bestfit procedure of the DNA sequence analysis 
software package developed by the Genetic Computer Group 
(GCG) at the university of Wisconsin (gap creation penalty - 
50, gap extension penalty - 3). 

A ribozyme comprising the antisense oligonucleotide of 
claim 19 and a ribozyme sequence. 

21 An antisense nucleic acid construct comprising a promoter 
seauence'and a polynucleotide sequence directing the synthesis of an 
a" RNA sequence of at least 10 bases being hybrid 1Z able „ v,vo, 
under strand en *a polypeptide 
at least 60 % homologous with SEQ ID NOs:3, 5 7 or 
portions thereof as determined using the Bestfit procedure of 
the DNA sequence analysis software package developed by 
the Genetic Computer Group (GCG) at the umversity of 
Wisconsin (gap creation penalty - 50, gap extension penalty - 

00 portion of a polynucleotide strand at least 60 % identical 
wfth SEQ ID NOs:l, 4, 6 or portions thereof as determined 
using the Bestfit procedure of the DNA sequence analysts 
software package developed by the Genetic Computer Group 
(GCG) at the university of Wisconsin (gap creation penalty - 
50, gap extension penalty - 3). 



1/8 

CGCTTAATTCTAGAAGAGGGATTGA 



atgagggtgctttgtgccttccctgaagcc^ 

LLIcLgLcJJ- 205 

ILLLl^ 325 

TTGGTGACCCTGGCCCGGGGACTTTCGCCCG^ ^ 

LLUagtJ^ " 5 

L^cIggL^ 685 

LgataLtgctctaLtg^^ " 5 
LLgUcaggaIg^ 

1165 

Llll™ 1225 

L™J r L rr ^ 1285 

L^TTxlc^ 1345 



1045 
1105 
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«*«~«^ 1585 

1645 
1105 

LcccL^^ 1765 

1825 

lllLl^ 1885 

CAAAGAGACTAAATGTCATAGCGTGATCTTAGCCTAGGTAGGCC^CATCCATCCCAAAGG 200S 
ftAAATGTAGACATCACCTGTACCTATATAAGGATAAAGGCATGTGTATAGAGCAA 
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M J,I,BSKPM.PPP"tt^ L = t,LG5LSPSaP 30 

51 pj^GLKEKTLILLDV5TKNPVBTWE^FLSWLDPSIIHD.GWLDFLSSK 99 

31 U..WD^l.U™^«« FMVTlnM,IATDP ' ,FLIL " 5SC " 
100 MjVTIJUWLSPAFLRFGGKRTDEXQFQNLWPAKSRGGPGPDYYLKHYED 149 

,3 K^iiiUi™GTKll ) rilP....DPKK E ST fE EPS™ 0S Q™ 0 124 

„. DI V M DV^K WG CKIAO«Pr^E M R E K MQ HH 1 ™S»TVS„ 199 

l25 I,' ^sjpPD^EKLBLEWPVQEQLLLMHYQKKF™ 162 

ax, skg^isweI^nepnsflkkadifingsqlgedfiqlhkllrk. stfkna. .« 

300 S^IGRPRICHVX^DG™,™^!^— 349 
262 KLYGPDVMPWKTAK^KSFLKAGGEviDSVTWHHYYLNGRTATREDFE 311 
350 KTRLLDTLSDQIREIQKVVNTYTPGKKIWLEGVVTTSW^TNNLSDSYAA 3,9 

312 hpo^cxfxsstoI^strpgUlget^smgggaplesotfaa 361 

362 i^KlGLsUiE^^FG^^FOPEPO^ESU.FK 411 
112 KLVGTKVU1ASVQGSKRB KLRVYLHCTNTDHPRYKEGDLTL 45 2 

.53 ^RWtLeUpFPFSKK^ELRPEGPEGEESKS^EHGETEKM 50 2 
5 50 VDDGTLPELKPRPLRAGRTLVIPPVTMGFEVVKNVMMjACRYR 59 2 
503 ^UUpiUssUUYSFFiliU^l. 
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1 

SEQUENCE LISTING 
GENERAL INFORMATION: Jris pecker et al 



INFORMATION : Iris pecker et al. polypeptides 

TITLE^OF T INVENTION : -LVNUCLEOTICES^LVPE^^ 

hiimrfr OF SEQUENCES: 24 

CORRESPONDENCE ADDRESS : .„ c/o AnthonyCastorina 

(R > ADDRESSEE: Soi^ Jef£etson Davis Highway, Suite 

(B) STREET: Arlin ^°" 

(D) STATE : understates of America 

(E) COUNTRY : 

l COMPUTER P READABLE FORM : megabyte, 3.5" microdisk 

' \M MEDIUM TYPE: Twlnheld* Slimnote-890TX 

(B) COMPUTER: QS vers ion 6.2, 

y s ™ 

(D) SOFTWARE: Wor<1 IO converted to an ASCI 

,R1 FILING DATE: 

C CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION : Sne inbein, Sol 

In! REGISTRATION NUMBER: » .457 

c REFERENCE/ DOCKET NUMBER: 201US 

TELECOMMUNICATION INFORMATION : 972 _ 3 -6127676 

(A) TELEPHONE: 972 -3-61Z7575 

(B) TELEFAX: 

(C) TELEX: 

<* TYPE'" n«S.ie acid 

' STRANDEDNESS: double 

TOPOLOGY : linear 



(xi ) SEQUENCE DESCRIPTION : SEC | ID HO: 1 : 

cgcttaattcVagaagaggg attgaatgag ggt^tttgt 

AAGCCATGCC CTCCAGCAAC ^CCCGCCCCC CCT CCC AGGCTGG 

OTCTCTACT TGGCTCTGTT gctccatctc tcc g ^ggaaAAGA 

AGACAGGAGA CCCTTGCCTG TAGACAGAGC TGC G ACAGTCAAT 

ccctgattct acttgatgtg agcaccaaga AC£ ATGATGGCTG 

GAGAACTTCC TCTCTCTGCA GCTGGATCCG CGGGGACTTT 

GCTCGATTTC CTAAGCTCCA ^CGCTTGGT GA£ ^ CCTGCRGTT c 

CGCCCGCCTT TCTGCGCTTC ^GGGCAAAA W, <. GCCCGG ATTA 

CAGAACCTGA GGAACCCGGC ^GCCCC GG ^ GTTGC CTTAG 

CTATCTCAAA AACTATGAGG ^GACATTG^ GT TATGCTGGAG 

AT AAAC AG AA AGGCTGCAAG ^TGCCCAGC AU. TR AAGGAGCA 
CTCCAAAGGG AGAAGGCAGC £AGATGCAT CTGCT^ TCTCTRG ACA 

ATTCTCCAAT ACTTACAGTA ^"CATATT AT RTTTG CTCTA 

aactttataa ctttgctgat toctctggac tcc^ cagtt ctagtgccct 

AATGCACTGC GTCGTAATCC CAATAACT^ att TC TTGGGAAC 

GAGTCTGTTG AAGTACAGCG CCAGCAAAAA fal GGCAGTAAAT 
TGGGTAATGA GCCAAATAAC ^TCGGACCA TGO^^ TGTTGCAGCC 
GGCAGCCAGT TGGGAAAGGA "ACAI^h tGGCCCTAAT ATTGGGCGGC 
CATCCGGATT TATTCCAGAG CCAGCTTATA ™ GGTGGCAGGA 
CGAGGAAGAA TGTCATCGCC CTCCTAGATG RTTG RTGGCCGGGT 

agtacagtag atgcagttac "ggcaaca grc RCACT ctctg 

GGTCAAGGTG ATGGACTTCC TGAAAACH,^ aTACATACAC TCCAGGAAAG 

accagattag gaaaattcag aaagtggtta ataca ^acaaacaa 

AAGATTTGGC TTGAAGGTGT GGTGACCACC £ A CTTTAGGAA 
TCTATCCGAT TCCTATGCTG CAGGATTCTT A TC R TTTTTTGAC 

TGCTGGCCAA TCAGGGCATT GATGTCGTGA T CRT -j- R CCAGACTA 

CATGGATACA ATCACCTCGT GGACCAGAAT GTCTTGGCTG 
CTGGCTCTCT CTCCTCTACA ^CGCCTGAT ^ AGTGATCCGG 
TGCATGTGGC TGGGCTCCAG CGGAAGCCAC ACCACAACTA 
GACAAACTAA GGATTTATGC ^ACTGCA G AACTTGCAT CGATCAAGAA 
CGTTCGTGGG TCCATTACAC "TTTATCAT GT TCAC CAGTAC 

AGAAAATCAA GCTGGCTGGG ^TCTCAGAG ^ CAGTGCAACT 
CTGCTGCAGC CCTATGGGCA GGAGGGCCTA ^TC ccftGAATTGR 
GAATGGCCAG CCCTTAGTGA TGGTGGACGA c TCCRG TCACC 

ATGGGCTTTT GAATGTCAAT GCTTTGGCCT GCCGCTACCG 
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1900 



ATAAGCTATC CTCACACTCA -GGCCTGCT GGGCTG^T 

sssss ssssg gsss sssss sssg 

SS GTAGACATCA TATAAGGATA AAGGCATGTG ^ 2050 

TATAGAGCAA 

B TYPE: 

ri STRANDEDNESS : double 

d topology: 1 j:" e " N0 .2- 

(xi) SEQUENCE ASCRIPTION: SEC ID HO- ^ ^ 

l7i ccc atg ccc tcc agc aac 
atg agg gtg ctt tgt gcc ttc cct gaa gcc atg 

Met Arg Val Leu Cys Ala Phe Pro oiu ^ 15 

TCC CGC CC c ccc J « = s « - E S 5 S E 



™ ™ ™ i I S S T » 55 s si S S S S «| 

Leu Leu Leu His eu ^ ^ ^ 

s s s ;s ss s «. s s ~ ». «. - - 

5SSSSSS5SSSSS5S 
5! S S S K = S = S S = S S 31 5 



Glu 



80 „ r-rr TTG GTG ACC CTG GCC 

S S 2 K = S = = S = - 1 ~ - SS 3a5 

^ TXC CTC C.O SS « « « ™ »=„ S S Si SS 5 

„p n , M ». s _ „„ ^ TlT caG CTT £ ,„ 



S g S ffi 5 5 5 5 S is = ™ «• - » 

GCC CAG CAC 
Ala Gin His 



~~ n-rr rTG GAG CTC CAA AGG GAG AAG 

ATT GCC CAG CAC CCT GAT GTT ATG CTG GAG ( 

Tip Ala Gin His Pro Asp Val Met Leu l, 



l ' u . ;, r nr CAA TTC TCC AAT 610 

s E SS s ss s c s = i = s « s " a W5 
K 5 ST. K S E 25 £ S i ~ = S S S « 

s s s si 5 s = s = g = a = = i ° 

- AAC TCC TGG AAC ACT TCT AGT 745 

o Asn Asn Ser Trp Asn Ser Ser Ser 

n..r- ,rr TCC AGC AAA AAG TAC AAC ATT ">90 
GCC CTG AGT CTG TTG «G TAC AGC GCC AGC AAA ^ ^ ^ 

= ;==s====|5===s " 5 

BB sa=B=s:=l5=sas: - 
= - = =S=ssss|s=s= s ; 

S 2 S S = S IS 3 S S = S S 5 S -» 
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3 20 tt rAT III CGG GTG GTC AAG GTG 1060 

GTT ACC TGG C f CAT TGC TAG ATT GAT GGC ^ ^ Lys 

val Thr Trp Gin His Cys 34Q nQ5 

ATG GAC TTC CTG S - «C « ™ - *£ « E MP Gin 
Met Asp Phe Leu Lys Thr Arg ^ 3bu 

_ ftGG ^ A TT CAG AAA GTG GTT AAT ACA TAG ACT CCA GGA £ 
5. £ £ - Gin Lys val Val Asn Th Ty , ^ 

AAG ATT TGG CH £ - GTG GTG ACC ACC TCA GCT ^ ^ 
LyS He Trp Leu G1U Gly TGG TTG AAC 12.0 

MC AAT CTA TCC GAT TCC TAT GCT GCA GGA Phe Leu Trp Leu Asn 
Asn Asn Leu Ser Asp Ser Ty ^ «05 

f 25 S - SS E Asn £ - val Val He Arg 



^ attT CAC CTC GTG GAC CAG AAT 

, c TCA TTT TTT GAC CAT GGA TAG AA^ ^ ^ ^ Rsp Gln Asn 
.s ser Phe Phe ^ 430 ^ ^ ^ 

R TTA CCA GAC TAC TGG CTC TCT ^ ^ Ly _. ftrg 



1285 
1330 
1375 

»T TCA TTA CCA GAC TAC TGG CTC TCT 1* — -~ Arg 

12 £ Pfo S « - - ^ = t CTC «» 1,0 

CGG AAG CCA CGG CCT GGC CGA GTG ATC CGG GAC AAA CTA AGG 
Arg Ly s Pro Arg Pro Gly ^ ^ 

TAT GCT CAC TGC ACA AAC CAC CAC AAC CAC AAC ^ ^ Rrg 
Tyr Ala His cys Thr Asn x ^ ^ 

TCC ATT ACA CTT TTT ATC ATC AAC TTG CAT ^ Rrg Lys 



ser lie Thr Leu Phe e ^ ^ l6Q0 

X CTC AGA GAC AAG CTG Bi Tyr 



ATC AAG CTG GCT GGG ACT CTC AGA ^ t 

He Lys Leu Ala Gly TCA GTG 1645 

CTG CTG CAG CCC TAT « £ £ « £ £s Ser Lys Ser Val 

r««-2s«sssSssssss 1690 
^=«|=s5sssssss = s 1135 

Pro Glu Leu Lys Pro Arg Pro ^ 570 

ATC CCT CCA GTC ACC ATG GGC TTT TTT GTG GTC AAG AAT GTC 

m Pro Pro val Thr e - ^ ^ ^ S35 ^ 

GCT TTG GCC TGC CGC TAC CGA TAA GC1 

Z Z Z Z 1 Z » s s s g s s s s ii| 
IIIIISSSbSSSSss S 

TAT AGA GCA A 

... t rwrzTH: 3 - 



LENGTH : 592 
TYPE: 

STRANL 

TOPOLOGY : 



(C) STRANDEDNESS : 

(xi) s^**^"^'-^" p~ 3 »« s « t 5 

r, 1: 1: ri 2 --I"-"- 

Ser Arg 2Q Rsp Rrg Arg 

L eu Leu Leu His Leu Ser Leu Ser Ser G^ , 

Pro Leu Pro val £ Arg Ala Ala Gly Leu Lys Glu Lys Thr Leu 

pro i,eu « Thr Val Asn 

65 . c er ne He His Asp 
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85 



90 



Gly Trp Leu Asp Phe Leu Ser Ser Lys Arg Leu Val Thr Leu Ala 

Arg Gly Leu Ser Pro Ala Phe Leu Arg Phe 6 ly «, Lys Arg Thr 

hi rin Phe GJn Asn Leu Arg Asn Pro Ala Lys Ser Arg 
Asp Phe Leu Gin Phe Gin Asn ^ p 5 

Gly Gly Pro Gly Pro Asp Tyr T y r Leu Lys Asn Tyr Glu Asp Asp 

,1. val Arg Ser III Val Ala Leu Asp Lys Gin Lys Gly Cys Lys 

Ue Ala Gin His Pro Asp Val Met Leu Glu Leu Gin Arg Glu Lys 

Aia Ala Gin Met His Leu Val Leu Leu Lys Glu Gin Phe Ser Asn 

Thr Ty r ser Asn Leu He Leu Thr Ala Arg Ser Leu Asp Lys Leu 

20 <= rw Leu His Leu He Phe Ala Leu 

Tyr Asn Phe Ala Asp Cys Ser Gly Leu His u ^ 

215 „ Ti^r, nn ser Trp Asn Ser Ser Ser 

Asn Ala Leu Arg Arg Asn Pro Asn Asn ser p ^ 

Ala Leu ser Leu Leu Lys Tyr Ser Ala Ser Lys Lys Tyr Asn lie 



S er Trp Glu Leu Gly Asn Glu Pro Asn Asn Tyr Arg Thr 
Gly A rg Ala Vol | Gly Ser Gin Leu Gly Lys As P Tyr He Gin 
Le u Lys ser Leu Leu Gin Pro He Arg He Tyr Ser Arg Ala Ser 



L eu Tyr Gly Pro Asn He Gly Arg Pro Arg Lys Asn Val He Ala 

L eu Leu Asp Gly Phe Met Lys Val Ala Gly Ser Thr Val Asp Ala 

Val Thr Trp Gin His Cys Tyr He Asp Gly Arg val val Lys Val 

Met As p Phe Leu Lys Thr Arg Leu Leu Asp Thr Leu Ser Asp Gin 

Ile Arg Lys He Gin Lys Val Val Asn Thr Tyr Thr Pro Gly Lys 

Lys He Trp Leu Glu Gly Val Val Thr Thr Ser Ala Gly Gly Thr 

Asn Asn Leu Ser Asp Ser Tyr Ala Ala Gly Phe Leu Trp Leu Asn 

Thr Leu Gly Met Leu Ala Asn Gin Gly He Asp Val Val He Ar, 

„ ls Ser Phe Phe Asp His Gly Tyr Asn His Leu Val Asp Gin Asn 

Phe Asn Pro Leu Pro Asp Tyr Trp Leu Ser Leu Leu Tyr Lys Arg 

L eu lie Gly Pro Lys Val Leu Ala Val His Val Ala Gly Leu Gin 

At g Lys Pro Arg Pro Gly Arg Val He Arg Asp Lys Leu Arg IX. 

4 Z° t „ «i. His Asn His Asn Tyr Val Arg Gly 
Tyr Ala His Cys Thr Asn His His A ^ 495 

Ser lie Thr Leu Phe He He Asn Leu His Arg Ser Arg Lys Lys 

5 ?° mw ID „ A ra aso Lys Leu Val His Gin Tyr 
lie Lys Leu Ala Gly Thr Leu Arg Asp Lys ^ 

L eu Leu Gin Pro Tyr Gly Gin Glu Gly Leu Lys Ser Lys ser val 

G1 „ Leu Asn Gly III Pro Leu Val Met Val Asp Asp Gly Thr Leu 

Pro Glu Leu Lys Pro Arg Pro Leu Arg Ala Gly Arg Thr Leu Val 

U. Pro Pro val Thr Met Gly Phe Phe Val Val Lys Asn Val Asn 

Ala Leu Ala Cys Arg Tyr Arg 
590 



(71 INFORMATION FOR SEQ ID NO: 4: 

(2) INtUKBi sequence CHARACTERISTICS: 

(A) LENGTH: I 89 - . 

bJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEC I ID NO 4. 

cgcttaat'tc 'tagaagaggg £TGAATGAG cgtgctttgt occttcc^ 

GCTCTCTACT SSSS TCCCTTTCCT CCCAGGCTGG 
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1000 
1050 
1100 



1700 
1750 
1800 



agacaggaga cccttgcctg S agagc Jggg£%SS!S£ 

CCCTGATTCT ACTTGATGTG ^CACCAAGA ^ ATGATGGCTG 

gagaacttcc tctctctgca gctggatccg cggggacttt 

GCTCGATTTC CTAAGCTCCA AGCGCTTGGT gCC „ CCTGCAGTTC 

CGCCCGCCTT tctgcgcttc gggggcaaaa GO gcccggatta 

CAGAACCTGA GGAACCCGGC GJ^CC" ^CTAGACAAA CTTTATAACT 
CTATCTCAAA AACTATGAGG ^TGCCAGGTC 1 TGCACTGCGT 
TTGCTGATTG CTCTGGACTC CACCTGATAT JTBL^ GTCTGTTGAA 
CGTAATCCCA ATAACTCCTG ^AACAGTTCT ^G^^ GGTAATGRG c 
GTACAGCGCC AGCAAAAAGT ACAACAI 1 ^ TGG CAGCCAGTTG 
CAAATAACTA TCGGACCATG CATGGCCGGG "GTAAA TCCGGATTTA 
GGAAAGGATT ACATCCAGCT GAAGAGCCTG ™ Q AGGAAGA ATG 

TTCCAGAGCC AGCTTATATG ^CCTAATAT 1 <. -pACAGTAGAT 

TCATCGCCCT CCTAGATGGA "CATGAAGG G TCAAGGTGA T 

GCAGTTACCT GGCAACATTG CTACATTGAT CTGRC CAGATTAGGA 

GGACTTCCTG AAAACTCGCC TGTTAGALA^ QAA G ATTTGGCTT 

AAATTCAGAA AGTGGTTAAT ACATACRCTC ^A TATCCGATTC 
GAAGGTGTGG TGACCACCTC AGCTGGAGGC CTGGCCAATC 
CTATGCTGCA GGATTCTTAT GGTTGAACAC ™ TGGATACAAT 
AGGGCATTGA TGTCGTGATA CGGCACTCAT ^ GGCTCTCTCT 
CACCTCGTGG ACCAGAATTT TAACCCATTA CCAGA 

CCTCTACAAG CGCCTGATCG ^CCCCAAAGT A CAAAcT AAGG 

GGCTCCAGCG GAAGCCACGG "TGGCCGAG G TTCG TGGGTC 

ATTTATGCTC ACTGCACAAA CCACCACAAC ^ AAAATCAAGC 
CATTACACTT TTTATCATCA ACC AGTACCT GCTGCAGCCC 

TGGCTGGGAC TCTCAGAGAC ^CTGGTTC ATGGCCAGCC 
TATGGGCAGG AGGGCCTAAA GTCCAAGTCA ^ CCC CGCCCCC 
CTTAGTGATG GTGGACGACG GGACCCTCCC ^ GGGCTTTTTT 
TTCGGGCCGG CCGGACATTG GTCATCCCTC £ GRT AAGCTATCCT 

GTGGTCAAGA ATGTCAATGC TTTGGCCTGO ^ CTCC TCCACT 

CACACTCATG GCTACCAGTG GGCCTGCTGG GC CCCTGCTGCC 

SSESS SS tccatcccaa aggaaaatgt .. 
K3SS S SSSSSS HSSK tagagcaa 1898 

, A) LENGTH: 538 

B) TYPE: amino acld 

CI STRANDEDNESS: single 

jo) topology: ij;" e " -5- 

(xi , SEQUENCE DESCRIPTION: SEQ ID NO. 5. 
Met Arg Val Leu Cys Ala Phe Pro Glu Ala Met Pro Ser Ser Asn 
SeE Arg Pro Pro Ala Cys feu Ala Pro Gly Ala Leu Tyr Leu Ala 
Leu Leu Leu His Leu Ser Leu Ser Ser Gin Ala Gly Asp Arg Arg 
Pro Leu Pro Val Asp Arg Ala Ala Gly Leu Lys Glu Lys Thr Leu 

a n vfl Ser Thr Lys Asn Pro Val Arg Thr Val Asn 

lie Leu Leu Asp Val Ser mi uy 7q 75 

J 5 ,»„ rin Leu Asp Pro Ser He He His Asp 

Glu Asn Phe Leu Ser Leu Gin Leu Asp g0 

Gly Tr P Leu Asp Phe L.- Ser Ser Lys Arg Leu Val Thr Leu Ala 
Arg Gly Leu Ser Pro Ala Phe Leu Arg Phe Gly Gly Lys Arg Thr 
Asp Phe Leu Gin Phe Gin Asn Leu Arg Asn Pro Ala Lys Ser Arg 
Gly Gly Pro Gly III As P Tyr Tyr Leu Lys Asn Tyr Glu Asp Ala 
Arg Ser Leu Asp Lys Leu Tyr Asn Phe Ala Asp Cys Ser Gly Leu 
His Leu lie Phe Ala Leu Asn Ala Leu Arg Arg Asn Pro Asn Asn 
ser Trp Asn ser III Ser Ala Leu Ser Leu Leu Lys Tyr Ser Ala 
185 _ , Rsn Glu Pro Asn 

Ser Lys Lys Tyr Asn He Ser Trp Glu Gly 
Asn Tyr Arg Thr III His Gly Arg Ala Val Asn Gly Ser Gin Leu 
Gly Lys Asp Tyr III Gin Leu Lys Ser Leu Leu Gin Pro He Arg 
Ile T yr ser Arg III Ser Leu Tyr Gly Pro Asn He Gly Arg Pro 
Ar g Lys Asn Val He Ala Leu Leu As P Gly PheKet Lys Val Ala 
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Gly Ser Thr Val Asp Ala Val Thr Trp Gin His Cys Tyr He Asp 
Gly Arg Val Val Lys Val Met Asp Phe Leu Lys Thr Arg Leu Leu 
Asp Thr Leu Ser Ala Gin He Arg Lys lie Gin Lys Val Val Asn 
Th r Tyr Thr Pro III Lys Lys He Trp Leu Glu Gly Val Val Thr 
Thr Ser Ala Gly Gly Thr Asn Asn Leu Ser Asp Ser Tyr Ala Ala 
Gly Phe Leu Trp Leu Asn Thr Leu Gly Met Leu Ala Asn Gin Gly 



lie Asp val val He Arg I 



s Ser Phe Phe Asp His Gly Tyr Asn 



, Leu Val Asp Gin Asn Phe Asn Pro Leu Pro Asp Tyr Trp Le, 
Ser Leu Leu Tyr Lys Arg Leu lie Gly Pro Lys Val Leu Ala Val 
His val Ala Gly lJu Gin Arg Lys Pro Arg Pro Gly Arg Val He 
Arg Asp Lys Leu Arg He Tyr Ala His Cys Thr Asn His His Asn 
His Asn Tyr Val Arg Gly Ser He Thr Leu Phe He He Asn Leu 
Gly Thr Leu Arg Asp 



His Arg Ser Arg Lys Lys He Lys Leu Ala 
Lys Leu Val His Gin Tyr " 



Tyr Gly Gin C 



Leu Lys Ser Lys Ser Val Gin Leu Asn Gly Gin Pro Leu Val Met 

Val Asp Asp Gly Tnr Leu Pro Glu Leu Lys Pro Arg Pro Leu Arg 

500 = u: > 

Ala Gly Arg Thr Leu 



val lie Pro Pro Val Thr Met Gly Phe Phe 



val Val Lys Asn Val Asn Ala Leu Ala Cys Arg Tyr Arg 
530 535 

21 INFORMATION FOR SEQ ID NO: 6: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1724 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6: 

GCTCGATTTC CTAAGCTCCA AGCGCTTGGT GACCCTGGCC CGGGGACTTT 350 
^TCGATTTC CTAAW£, QGGGGCAAAA GGACCGACTT CCTGCAGTTC 400 

GCCGGGCAGT r-sTTTATTfr AGAGCCAGCT TATATGGCCC 600 

■■■■■ I 

mmmmM i 

" T „ rarCA GTACCTGCTG CAGCCCTATG GGCAGGAGGG CCTAAAGTL.L. 
HgtcIgtcc AACTGAATCG CCAGCCCTTA GTGATGGTGG ACGACGGGAC 
C^TCCCAGAA TTGAAGCCCC GCCCCCTTCG GGCCGGCCGG ACATTGGTCA 
"TCCCAGAA TTGAAGWA. TCAAGAATGT CAATGCTTTG 

gcctgccgct accgataagc tatcctcaca ctcatggcta ccagtgggcc 

TGCTCGGCTG CTTCCACTCC TCCACTCCAG TAGTATCCTC TGTTTTCAGA 
CATCCTOGCA ACCAGCCCCT GCTGCCCCAT CCTGCTGGAA tcaacacaga 
CTTGCTCTCC AAAGAGACTA AATGTCATAG CGTGATCTTA GCCTAGGTAG 
GCCACATCCA TCCCAAAGGA AAATGTAGAC ATCACCTGTA CCTATATAAG 
GATAAAGGCA TGTGTATAGA GCAA 



1350 
1400 

1500 
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(2) INFORMATION FOR SEQ ID NO: 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4 80 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
Met Arg Val Leu Cys Ala Phe Pro Glu Ala Met Pro Ser Ser Asn 

Ser Arg Pro Pro Ala Cys Leu Ala Pro Gly Ala Leu Tyr Leu Ala 

Leu Leu Leu His Leu Ser Leu Ser Ser Gin Ala Gly Asp Arg Arg 
35 40 4= 

Pro Leu Pro Val Asp Arg Ala Ala Gly Leu Lys Glu Lys Thr Leu 
50 55 

lie Leu Leu Asp Val Ser Thr Lys Asn Pro Val Arg Thr Val Asn 
65 70 

Glu Asn Phe Leu Ser Leu Gin Leu Asp Pro Ser He He His Asp 

Gly Trp Leu Asp Phe Leu Ser Ser Lys Arg Leu Val Thr Leu Ala 
Arg Gly Leu Ser Pro Ala Phe Leu Arg Phe Gly Gly Lys Arg Thr 
HO 115 e ~ ° 

Asp Phe Leu Gin Phe Gin Asn Leu Arg Asn Pro Ala Lys Ser Arg 

Gly Gly Pro Gly Pro Asp Tyr Tyr Leu Lys Asn Tyr Glu Asp Glu 

Pro Asn Asn Tyr Arg Thr Met His Gly Arg Ala Val Asn Gly Ser 
155 I 60 ib:> 

Gin Leu Gly Lys Asp Tyr He Gin Leu Lys Ser Leu Leu Gin Pro 

lie Arg He Tyr Ser Arg Ala Ser Leu Tyr Gly Pro Asn He Gly 

Arg Pro Arg Lys Asn Val He Ala Leu Leu Asp Gly Phe Met Lys 
200 205 dlu 

val Ala Gly Ser Thr Val Asp Ala Val Thr Trp Gin His Cys Tyr 

He Asp Gly Arg Val Val Lys Val Met Asp Phe Leu Lys Thr Arg 
Leu Leu Asp Thr Leu Ser Asp Gin He Arg Lys He Gin Lys Val 
245 250 255 

Val Asn Thr Tyr Thr Pro Gly Lys Lys lie Trp Leu Glu Gly Val 

val Thr Thr Ser Ala Gly Gly Thr Asn Asn Leu Ser Asp Ser Tyr 
275 280 2 " 

Ala Ala Gly Phe Leu Trp Leu Asn Thr Leu Gly Met Leu Ala Asn 
290 295 ' 3UU 

Gin Gly He Asp Val Val He Arg His Ser Phe Phe Asp His Gly 
305 31° 

Tyr Asn His Leu Val Asp Gin Asn Phe Asn Pro Leu Pro Asp Tyr 

Trp Leu Ser Leu Leu Tyr Lys Arg Leu He Gly Pro Lys Val Leu 

Ala Val His Val Ala Gly Leu Gin Arg Lys Pro Arg Pro Gly Arg 

350 355 360 

Val He Arg Asp Lys Leu Arg He Tyr Ala His Cys Thr Asn His 

365 370 J /3 

His Asn His Asn Tyr Val Arg Gly Ser He Thr Leu Phe He He 

380 385 390 

Asn Leu His Arg Ser Arg Lys Lys He Lys Leu Ala Gly Thr Leu 

395 400 405 

Arg Asp Lys Leu Val His Gin Tyr Leu Leu Gin Pro Tyr Gly Gin 

Glu Gly Leu Lys Ser Lys Ser Val Gin Leu Asn Gly Gin Pro Leu 
425 430 435 

Val Met Val Asp Asp Gly Thr Leu Pro Glu Leu Lys Pro Arg Pro 
440 445 «" 

Leu Arg Ala Gly Arg Thr Leu Val He Pro Pro Val Thr Met Gly 

Phe Phe val Val Lys Asn Val Asn Ala Leu Ala Cys Arg Tyr Arg 

470 475 4BU 

(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 

(B) TYPE: amino acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
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GTTCGGCAGA GGATCATGTC TGATGTACAG ^ACATTGTC 8 CGGAGTGATG 50 

BSIiBi 

ATTTCTTGGG 350 

(A) LENGTH: 543 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N°:9: 
Met Leu Leu Arg Ser Lys Pro Ala Leu Pro Pro Pro Leu Met Leu 
Leu Leu Gly Pro Leu Gly Pro Leu Ser Pro Gly Ala Leu Pro Arg Pro 
Ala Gin Ala Gin Asp Val V.l Asp Leu Asp Phe Phe Thr Gin Glu Pro 
Leu His Leu Val Ser Pro Ser Phe Leu Ser Val Thr lie Asp Ala Asn 
Leu Ala Thr Asp Pro Arg Phe Leu He Leu Leu Gly Ser Pro Lys Leu 
Arg Thr Leu Ala Arg Gly Leu Ser Pro Ala Tyr Leu Arg Phe Gly Gly 
Thr Lys Thr Asp Phe Leu He Phe Asp Pro Lys Lys Glu Ser Thr Phe 
Glu Glu Arg Ser Tyr Tr P Gin Ser Gin Val Asn Gin Asp He Cys Lys 
Tyr Gly III He Pro Pro Asp Val Glu Glu Lys Leu Arg Leu Glu Trp 
Pro Tyr Gin Glu Gin Leu Leu Leu Arg Glu Hi. Tyr Gin Lys Lys Phe 
Lys Asn Ser Thr Tyr Ser Arg Ser Ser Val Asp Val Leu Tyr Thr Phe 
Ala Asn Cys Ser Gly Leu Asp Leu lie Phe Gly Leu Asn Ala Leu Leu 
Arg Thr Ala As" Leu Gin Tr P Asn Ser Ser Asn Ala Gin Leu Leu Leu 
Asp Tyr cys Ser Ser Lys Gly Tyr Asn He Ser Trp Glu Leu Gly Asn 
Glu III Asn Ser Phe Leu Lys Lys Ala Asp He Phe He Asn Gly Ser 
III Leu Gly Glu Asp Tyr He Gin Leu His Lys Leu Leu Arg Lys Ser 
Thr Phe Lys Asn III Lys Leu Tyr Gly Pro Asp Val Gly Gin Pro Arg 
Arg Lys Thr Ala Lys Met Leu Lys Ser Phe Leu Lys Ala Gly Gly Glu 
val He Asp Ser Val Thr Trp His His Tyr Tyr Leu Asn Gly Arg Thr 
Ala Thr Arg Glu Asp Phe III Asn Pro Asp Val Leu Asp He Phe II. 
Ill ser val Gin Lys Val Phe Gin Val Val Glu Ser Thr Arg Pro Gly 
Lys Lys val Trp III Gly Glu Thr Ser Ser Ala Tyr Gly Gly Gly Ala 
Pro Leu Leu Ser Asp Thr Phe Ala Ala Gly Phe Met Trp Leu Asp Lys 
Leu Gly III Ser Ala Arg Met Gly He Glu Val Val Met Arg Gin Val 
Phe III Gly Ala Gly Asn Tyr His Leu Val >sp Glu Asn Phe Asp Pro 
III Pro Asp Tyr Trp III Ser Leu Leu Phe Lys Lys Leu Val Gly Thr 
Lys Val Leu Met "a Ser Val Gin Gly^Ser Lys Arg Arg LysL-eu Arg 
Val Tyr Leu Hifcys Thr Asn Thr Asp Asn Pro Arg Tyr Lys Glu Gly 
Asp Leu Thr Leu Tyr Ala lie Asn Leu His Asn Val Thr Lys Tyr Leu 
Arg III Pro Tyr Pro Phe III Asn Lys Gin Val Asp Lys Tyr Leu Leu 

465 , „ r-w, i pi, Leu Ser Lys Ser Val Gin Leu Asn 

Arg Pro Leu Gly Pro His Gly Leu Leu ser fcys s ^ 

Gly Leu Thr Leu Lys Met Val Asp Asp Gin Thr Leu Pro Pro Leu Met 



WO 01/00643 



PCT/IL00/00358 



Glu Lys Pro Leu Arg Pro Gly Ser Ser Leu Gly Leu Pro Ala Phe Ser 

515 520 525 

Tyr Ser Phe Phe Val lie Arg Asn Ala Lys Val Ala Ala Cys He 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GGAGAGCAAG TCTGTGTTGA TTC 23 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 

(B> TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CACTGGTAGC CATGAGTGTG AG 22 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 22 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(D) 



SEQ ID NO:12: 



INFORMATION FOR SEQ ID NO: 13: 

(X) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 23 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 
CTTGCCTGTA GACAGAGCTG CAG 23 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2396 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 
TTTCTAGTTG CTTTTAGCCA ATGTCGGATC AGGTTTTTCA AGCGACAAAG 
AGATACTGAG ATCCTGGGCA GAGGACATCC TAGCTCGGTC AGATTTGGGC 
AGGCTCAAGT GACCAGTGTC TTAAGGCAGA AGGGAGTCGG GGTAGGGTCT 
GGCTGAACCC TCAACCGGGG CTTTTAACTC AGGGTCTAGT CCTGGCGCCA 
AATGGATGGG ACCTAGAAAA GGTGACAGAG TGCGCAGGAC ACCAGGAAGC 
TGGTCCCACC CCTGCGCGGC TCCCGGGCGC TCCCTCCCCA GGCCTCCGAG 
GATCTTGGAT TCTGGCCACC TCCGCACCCT TTGGATGGGT GTGGATGATT 
TCAAAAGTGG ACGTGACCGC GGCGGAGGGG AAAGCCAGCA CGGAAATGAA 
AGAGAGCGAG GAGGGGAGGG CGGGGAGGGG AGGGCGCTAG GGAGGGACTC 
CCGGGAGGGG TGGGAGGGAT GGAGCGCTGT GGGAGGGTAC TGAGTCCTGG 
C GCCAGAGGC GAAGCAGGAC CGGTTGCAGG GGGCTTGAGC CAGCGCGCCG 
GCTGCCCCAG CTCTCCCGGC AGCGGGCGGT CCAGCCAGGT GGGATGCTGA 
GGCTGCTGCT GCTGTGGCTC TGGGGGCCGC TCGGTGCCCT GGCCCAGGGC 
GCCCCCGCGG GGACCGCGCC GACCGACGAC GTGGTAGACT TGGAGTTTTA 
CACCAAGCGG CCGCTCCGAA GCGTGAGTCC CTCGTTCCTG TCCATCACCA 
TCGACGCCAG CCTGGCCACC GACCCGCGCT TCCTCACCTT CCTGGGCTCT 
CCAAGGCTCC GTGCTCTGGC TAGAGGCTTA TCTCCTGCAT ACTTGAGATT 
TGGCGGCACA AAGACTGACT TCCTTATTTT TGATCCGGAC AAGGAACCGA 
CTTCCGAAGA AAGAAGTTAC TGGAAATCTC AAGTCAACCA TGATATTTGC 
AGGTCTGAGC CGGTCTCTGC TGCGGTGTTG AGGAAACTCC AGGTGGAATG 
GCCCTTCCAG GAGCTGTTGC TGCTCCGAGA GCAGTACCAA H 
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AGAACAGCAC CTACTCAAGA AGCTCAGTGG ACATGCTCTA CAGTTTTGCC 1100 

AAGTGCTCGG GGTTAGAGCT GATCTTTGGT CTAAATGCGT TACTACGAAC 1150 

CCCAGACTTA CGGTGGAACA GcTCCAACGC CCAGCTTCTC CTTGACTACT 1200 

GCTCTTCCAA GGGTTATAAC ATcTCCTGGG AACTGGGCAA TGAGCCCAAC 12 50 

AGTTTcTGGA AGAAAGCTCA CATTCTCATC GATGGGTTGC AGTTAGGAGA 1300 

AGACTTTGTG GAGTTGCATA AACTTcTACA AAGGTCAGCT TTCCAAAATG 1350 

CAAAACTCTA TGGTCCTGAC ATCGGTCAGC CTCGAGGGAA GACAGTTAAA 1400 

CTGCTGAGGA GTTTCCTGAA GGCTGGCGGA GAAGTGATCG ACTCTCTTAC 14 50 

ATGGCATCAC TATTACTTGA ATGGACGCAT CGCTACCAAA GAAGATTTTC 1500 

TGAGCTCTGA TGCGCTGGAC ACTTTTATTC TCTCTGTGCA AAAAATTCTG 1550 

AAGGTCACTA AAGAGATCAC ACCTGGCAAG AAGGTCTGGT TGGGAGAGAC 1600 

GAGCTCAGCT TACGGTGGCG GTGCACCCTT GCTGTCCAAC ACCTTTGCAG 1650 

CTGGCTTTAT GTGGCTGGAT AAATTGGGCC TGTCAGCCCA GATGGGCATA 1700 

GAAGTCGTGA TGAGGCAGGT GTTCTTCGGA GCAGGCAACT ACCACTTAGT IT 50 

GGATGAAAAC TTTGAGCCTT TACCTGATTA CTGGCTCTCT CTTCTGTTCA 1800 

AGAAACTGGT AGGTCCCAGG GTGTTACTGT CAAGAGTGAA AGGCCCAGAC 1850 

AGGAGCAAAC TCCGAGTGTA TCTCCACTGC ACTAACGTCT ATCACCCACG 1900 

ATATCAGGAA GGAGATCTAA CTCTGTATGT CCTGAACCTC CATAATGTCA 1950 

CCAAGCACTT GAAGGTACCG CCTCCGTTGT TCAGGAAACC AGTGGATACG 2000 

TACCTTCTGA AGCCTTCGGG GCCGGATGGA TTACTTTCCA AATCTGTCCA 2050 

ACTGAACGGT CAAATTCTGA AGATGGTGGA TGAGCAGACC CTGCCAGCTT 2100 

TGACAGAAAA ACCTCTCCCC GCAGGAAGTG CACTAAGCCT GCCTGCCTTT 2150 

TCCTATGGTT TTTTTGTCAT AAGAAATGCC AAAATCGCTG CTTGTATATG 2200 

AAAATAAAAG GCATACGGTA CCCCTGAGAC AAAAGCCGAG GGGGGTGTTA 2250 

TTCATAAAAC AAAACCCTAG TTTAGGAGGC CACCTCCTTG CCGAGTTCCA 2300 

GAGCT TC GGG AGGGTGGGGT ACACTTCAGT ATTACATTCA GTGTGGTGTT 2350 

CTCTCTAAGA AGAATACTGC AGGTGGTGAC AGTTAATAGC ACTGTG 2 39 6 



(C) STRANDEDNESS: single 

(D> TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGCAGCCAG GTGAGCCCAA GA 22 

(2) INFORMATION FOR SEQ ID NO : 17 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 24 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 
TCAGATGCAA GCAGCAACTT TGGC 24 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 21 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
CACCCTGATG TCATGCTGGA G 2 1 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CATCTAGGAG AGCAATGACG TTC 23 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
CCATCCTAAT ACGACTCACT ATAGGGC 27 

(2) INFORMATION FOR SEQ ID NO:21: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(2) INFORMATION FOR SEQ ID NO:22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: ■ 
TTTTTTTTTT TTTTT 15 

(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23 
GGCACGAGGC TAG TGG AGAG ACTGACAAGC AGTCAGCTCA GCGGTCACAA 50 
TACTGTGTGA CAGGAGCTGA GATCCAAGAA GTACTGGGTC CTGTGGGAGC 100 
ACCCCTGACT TGAAGGACAA GTCAGTGCAA CTGAATGGCC AGCCCTTAGT 150 
GAT GGTGGAC GACGGGACCC TCCCAGAATT GAAGCCCCGC CCCCTTCGGG 200 
CCGGCCGGAC ATTGGTCATC CCTCCAGTCA CCATGGGCTT TTTTGTGGTC 250 
AAGAATGTCA ATGCTTTGGC CTGCCGCTAC CGATAAGCTA TCCTCACACT 300 
CATGGCTACC AGTGGGCCTG CTGGGCTGCT TCCACTCCTC CACTCCAGTA 350 
GTATCCTCTG TTTTCAGACA TCCTAGCAAC CAGCCCCTGC TGCCCCATCC 400 
TGCTGGAATC AACACAGACT TGCTCTCCAA AGAGACTAAA TGTCATAGCG 4 50 
TGATCTTAGC CTAGGTAGGC CACATCCATC CCAAAGGAAA ATGTAGACAT 500 
CACCTGTACC TATATAAGGA TAAAGGCATG TGTATAGAGC AAAAAAAAAA 550 
AAAAAAAAAA 560 

(2) INFORMATION FOR SEQ ID NO: 24 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1721 

(B) TYPE: nucleic acid 
(C) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTAGAGCTTT CGACTCTCCG CTGCGCGGCA GCTGGCGGGG GGAGCAGCCA GGTGAGCCCA 60 
AGATGCTGCT GCGCTCGAAG CCTGCGCTGC CGCCGCCGCT GATGCTGCTG CTCCTGGGGC 120 
CGCTGGGTCC CCTCTCCCCT GGCGCCCTGC CCCGACCTGC GCAAGCACAG GACGTCGTGG 180 
ACCTGGACTT cTTCACCCAG GAGCCGCTGC ACCTGGTGAG CCCCTCGTTC CTGTCCGTCA 24 0 
CCATTGACGC CAACCTGGCC ACGGACCCGC GGTTCCTCAT CCTCCTGGGT TCTCCAAAGC 300 
TTCGTACCTT GGCCAGAGGC TTGTCTCCTG CGTACCTGAG GTTTGGTGGC ACCAAGACAG 360 
ACTTCCTAAT TTTCGATCCC AAGAAGGAAT CAACCTTTGA AGAGAGAAGT TACTGGCAAT 420 
CTCAAGTCAA CCAGGATATT TGCAAATATG GATCCATCCC TCCTGATGTG GAGGAGAAGT 480 
TACGGTTGGA ATGGCCCTAC CAGGAGCAAT TGCTACTCCG AGAACACTAC CAGAAAAAGT 54 0 
TC AAGAAC AG CACCTACTCA AGAAGCTCTG TAGATGTGCT ATACACTTTT GCAAACTGCT 600 
CAGGACTGGA CTTGATCTTT GGCCTAAATG CGTTATTAAG AACAGCAGAT TTGCAGTGGA 660 
ACAGTTCTAA TGCTCAGTTG CTCCTGGACT ACTGCTCTTC CAAGGGGTAT AACATTTCTT 720 
GGGAACTAGG CAATGAACCT AACAGTTTCC TTAAGAAGGC TGATATTTTC ATCAATGGGT 780 
CGCAGTTAGG AGAAGATTAT ATTCAATTGC ATAAACTTCT AAGAAAGTCC ACCTTCAAAA 640 
ATGCAAAACT CTATGGTCCT GATGTTGGTC AGCCTCGAAG AAAGACGGCT AAGATGCT G A 900 
AGAGCTTCCT GAAGGCTGGT GGAGAAGTGA TTGATTCAGT TACATGGCAT CACTACTATT 960 
TGAATGGACG GACTGCTACC AGGGAAGATT TTCTAAACCC TGATGTATTG GACATTTTTA 1020 
TTTCATCTGT GCAAAAAGTT TTCCAGGTGG TTGAGAGCAC CAGGCCTGGC AAGAAGGTCT 1080 
GGTTAGGAGA AACAAGCTCT GCATATGGAG GCGGAGCGCC CTTGCTATCC GACACCTTTG 1140 
CAGCTGGCTT TATGTGGCTG GATAAATTGG GCCTGTCAGC CCGAATGGGA ATAGAAGTGG 1200 
TGATGAGGCA AGTATTCTTT GGAGCAGGAA ACTACCATTT AGTGGATGAA AACTTCGATC 1260 
CTTTACCTGA TTATTGGCTA TCTCTTCTGT TCAAGAAATT GGTGGGCACC AAGGTGTTAA 1320 
TGGCAAGCGT GCAAGGTTCA AAGAGAAGGA AGCTTCGAGT ATACCTTCAT TGCACAAACA 1380 
CTGAC AATC C AAGGTATAAA GAAGGAGATT TAACTCTGTA TGCCATAAAC CTCCATAACG 1440 
TCACCAAGTA CTTGCGGTTA CCCTATCCTT TTTCTAACAA GCAAGTGGAT AAATACCTTC 1500 
TAAGACCTTT GGGACCTCAT GGATT AC TTT CCAAATCTGT CCAACTCAAT GGTCTAACTC 1560 
TAAAGATGGT GGATGATCAA ACCTTGCCAC CTTTAATGGA AAAACCTCTC CGGCCAGGAA 1620 
GTTCACTGGG CTTGCCAGCT TTCTCATATA GTTTTTTTGT GATAAGAAAT GCCAAAGTTG 1680 
CTGCTTGCAT CT GAAAATAA AATATACTAG TCCTGACACT G 1721 

(2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 45 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CTTACTTGTC ATCGTCGTCC TTGTAGTCTC GGTAGCGGCA GGCCA 4 5 



